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Foreword 


This book consists of a series of essays on non-Euclidean geometry in a broad sense, 
including the classical geometries of constant curvature (spherical and hyperbolic), 
but also de Sitter, anti-de Sitter, co-Euclidean, co-Minkowski, Hermitian geometries, 
and some others. A few central themes emerge from these essays, namely, trigonom- 
etry, convexity, rigidity, area, volume, and the relation between non-Euclidean geom- 
etry and projective geometry, in particular the connection with conics, quadrics and 
quadratic forms. Some essays deal with very classical questions and others address 
problems that are at the heart of present day research, but all of them are concerned 
with fundamental topics. 

The reader will see that the various essays contain many references to old math- 
ematical texts. This is a result of the fact that geometry is a rich world which is built 
up on a socle of knowledge acquired through the effort of many generations. For 
the authors of this foreword, reading primary sources has always been a rewarding 
activity: it is in these original texts that one can find the most profound ideas that 
dominate modern research. 

Although the topics considered in the various essays that constitute this volume 
are interconnected, the essays are completely self-contained. They vary in terms of 
length and difficulty. Sometimes, the same topic is treated in more than one of them, 
with more or less details, but the points of view are often different. The essays should 
be useful to researchers and students of non-Euclidean geometry. They are intended 
to be references for the various topics they present. 

We acknowledge support from the U.S. National Science Foundation grants DMS 
1107452, 1107263, 1107367 “RNMS: GEometric structures And Representation va- 
rieties” (the GEAR Network) and we warmly thank Marco Zunino for his excellent 
editorial work. Part of this work was done while the second editor was visiting the 
Yau Mathematical Sciences Center of Tsinghua University (Beijing). 


Vincent Alberge (New York), 
Athanase Papadopoulos (Strasbourg and Beijing), 
December 2018 


Introduction 


Classically, the expression “non-Euclidean geometry” refers to the two geometries 
of non-zero constant curvature, namely, the spherical and the hyperbolic.! Spheri- 
cal geometry was developed in Greek antiquity and it attained there a high degree 
of matureness, especially in the work of Menelaus of Alexandria (1‘'-2" centuries 
A.D.).2, Hyperbolic geometry is a nineteenth century achievement. It was discov- 
ered by Lobachevsky, Bolyai and Gauss, with a few anticipated results obtained in 
the eighteenth century, especially in the work of J. H. Lambert who developed a ge- 
ometry (which, from his point of view, was hypothetical), in which all the Euclidean 
axioms hold except the parallel postulate and where the latter is replaced by its nega- 
tion. Although Lambert’s goal was to find a contradiction in the consequences of this 
system of axioms (like many other prominent geometers, he thought that such a ge- 
ometry cannot exist), the net result of his investigations is a collection of interesting 
theorems in hyperbolic geometry.? 

In the present volume, the term “non-Euclidean geometry” is used in a broader 
sense, including geometries such as de Sitter, anti-de Sitter and others which can be 
developed, in analogy with the three classical geometries, in the framework of projec- 
tive geometry, using the language of conics, quadrics and quadratic forms. Studying 
the three classical geometries in the setting of projective geometry is also a nineteenth 
century achievement; it is due to the visions of Cayley, Klein, Beltrami, Poincaré, and 
a few others. 

Today, non-Euclidean geometry in this generalized sense is a very active research 
field and it seemed to us that editing a book containing self-contained surveys touch- 
ing several aspects of this domain was desirable for researchers and those who want 
to learn this subject. This is the reason for which this book is published. 


' The expression “non-Euclidean geometry,” to denote spherical and hyperbolic geometry, was 
coined by Gauss. 

2? The first English edition of Menelaus’ work just appeared in press: R. Rashed and A. Papadopou- 
los, Menelaus’ Spherics: early translation and al-Mahani/al-Harawi’s version, Scientia Graeco-Arabica 21, 
Walter de Gruyter, Berlin, 2017 

3 J. H. Lambert, “Theorie der Parallellinien,’ in Die Theorie der Parallellinien von Euklid bis auf 
Gauss, eine Urkundensammlung zur Vorgeschichte der nichteuklidischen Geometrie (P. Stackel and F. En- 
gel, eds.), B. G. Teubner, Leipzig 1895. French translation by A. Papadopoulos and G. Théret, La théorie 
des lignes paralléles de Johann Heinrich Lambert, Sciences dans |’Histoire, Blanchard, Paris, 2014. There 
is no English translation available of this work. 
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Geometry is one of these mathematical fields (probably the unique one to such 
a great extent) where the texts of the ancient great authors keep their full value, and 
going through them is not a matter of vain curiosity nor a matter of being interested 
in history, but it is the best way to understand present day mathematics. In this sense, 
the rapid growth of non-Euclidean geometry in the last few decades, after its revival 
by Thurston in the 1970s, is also a return to the sources of geometry. This is why 
the names of Menelaus, Pappus, Euler, Lambert, Lagrange, Lexell, Fuss, Schubert, 
Chasles, Study, Story, and several others are mentioned at several occasions in the 
various essays that constitute this volume. Going throughout these works also shows 
how slow is the process of development of geometry, despite the rapid (quantitative) 
growth of the literature in this field. The slow evolution process of mathematics is 
marked out by abrupt changes attached to names such as those we just mentioned. 

Before describing in some detail the content of this volume, we would like to make 
a few remarks on two topics that are at the heart of the various surveys that constitute 
it, namely, the notion of area in non-Euclidean geometry, and the geometry of conics. 

In spherical geometry, the area of a triangle is (up to a constant multiplicative 
factor) the excess of its angle sum with respect to two right angles. In other words, 
the area of a triangle with angles A, B, C is equal to A+ B+ C —z, up to a constant 
factor that does not depend on the choice of the triangle. In hyperbolic geometry, 
the area of a triangle is (again up to a constant factor) the deficiency of the angle 
sum with respect to two right angles, that is, m — (A + B+ C). The fact that in 
spherical (respectively hyperbolic) geometry the angle excess (respectively the angle 
deficiency) of an arbitrary triangle is positive, is probably the most important feature 
in that geometry. As a matter of fact, there exist classical proofs of the trigonometric 
formulae in spherical and hyperbolic geometries that are exclusively based on these 
properties. The oldest such proofs that we are aware of are due to L. Gérard,* a 
student of Poincaré. Since the trigonometric formulae contain in essence all the geo- 
metric information on a space, spherical and hyperbolic geometry are essentially the 
geometries where angle excess or, respectively, angle deficiency of all triangles are 
positive. Thus, it is not surprising that several chapters in the present volume concern 
the notion of area and its use in non-Euclidean geometry. 

After area, one naturally considers volume in higher-dimensional spherical and 
Euclidean geometries, and here we enter into the realm of difficult problems. One 
might recall in this respect that Gauss, in a letter to Wolfgang Bolyai (the father of 
Janos Bolyai, the co-discoverer of hyperbolic geometry), dated March 6, 1832,° after 
he gave an outline of a proof of the area formula for a hyperbolic triangle as angle defi- 
ciency, asked his friend to suggest to his son to work on the determination of volumes 

4L. Gérard, Sur la géométrie non euclidienne, Thése Ne 768, Faculté des Sciences de l’Université de 
Paris, Gauthier-Villars, Paris, 1892. 


5 See Gauss’s Collected works, Vol. 6, p. 221, and the article by P. Stickel and F. Engel “Gauss, Die 
beiden Bolyai und die nichteuklidische Geometrie,” Math. Annalen 2 (1897), 149-167. 
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of tetrahedra in three-dimensional hyperbolic space. It is conceivable that Gauss was 
unaware of the fact that this subject was extensively studied by Lobachevsky dur- 
ing the same period, but he probably knew from his own experience that obtaining 
formulae for volumes of tetrahedra in hyperbolic and spherical spaces is a difficult 
problem. It turns out indeed that there is no simple formula for the volume of a 
hyperbolic (or spherical) tetrahedron in terms of the dihedral angles that is compa- 
rable to the formula for the area of a triangle. The existing formulae (first obtained 
by Lobachevsky) give expressions of volume in terms of an integral function which 
bears the name Lobachevsky function and which is closely connected to the Euler 
dilogarithm function. These formulae involve the value of the Lobachevsky function 
at the dihedral angles of the tetrahedron. 

Several computations of volumes of tetrahedra were conducted after Lobache- 
vsky’s work but the progress was slow. Some of the most important conjectures that 
are open in three-dimensional geometry of hyperbolic (or spherical) space concern 
volume. We mention incidentally that the computation of volumes of hyperbolic 
tetrahedra is the subject of Chapter 8 of Thurston’s notes on three-dimensional ge- 
ometry.® 

Talking about volume, let us also mention that there are interesting formulae for 
volumes of Euclidean tetrahedra that are due to Euler. The latter, in a letter to his 
friend Christian Goldbach, dated November 14, 1750,” announced a formula for the 
volume of a simplex in terms of its side lengths, a three-dimensional analogue of 
Heron’s formula for the area of a triangle.* Later on, he gave the proof of that formula, 
in a paper published in 1758.° In the same paper, he provided several other formulae 
for volumes of Euclidean simplices, including a formula in terms of side lengths, and 
another one in terms of lengths of the three edges abutting on a solid angle along with 
the three plane angles that these sides form. 

Finally, let us note that computing area and volume, beyond those of polygons 
and polyhedra, is another subject that has a long history whose origin can be traced 
back to the work of Hippocrates of Chios (5'" century B.C.) who studied areas of 
lunes (intersections of two discs) and other figures, and which was brought to a high 
degree of sophistication in the work of Archimedes (3™ century B.C.). The reader 
can imagine that computing areas and volumes without the use of the mechanical 
methods of modern calculus is always a challenging problem. 


© W. P. Thurston, Geometry and topology of three-manifolds, Princeton Lectures Notes, Princeton, 
N.J., 1979. 

7 Letter Ne 149 in Euler’s volume of his Opera omnia containing his correspondence with Goldbach, 
Series quarta A, Vol. IV, Birkhauser, Basel, 2014. 

8 Heron’s formula is contained in the Codex costantinopolitanus Palatii Veteris, ed. and transl. by 
E. M. Bruin, Leiden, 1964. 

°L. Euler, Demonstratio nonnullarum insignium proprietatum, quibus solida hedris planis inclusa 
sunt praedita, Novi Commentarii academiae scientiarum petropolitanae 4 (1752), 140-160, in Opera om- 
nia, Series 1, Vol. XXVI, 94-108. 
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Let us pass to our second topic, projective geometry and conics. We note right 
away that the study of conics dates back to Greek antiquity, where the most important 
treatise on the subject is the multi-volume work of Apollonius of Perga (3"-2™ c. 
B.C.),!° which remained until the seventeenth century one of the most fundamental 
and most read mathematical texts among mathematicians. A conic is the intersection 
of a plane with a right cone whose basis is circular. If we think of projective geometry 
as the study of properties of figures that are invariant by projections, it is in the study 
of conics that we find the root of this field: the cone vertex is the center of projection. 
The notion of cross-ratio as a projective invariant also dates back to Greek antiquity, 
it is used in Menelaus’ Spherics.? 

In his 1859 paper “A sixth memoir upon quantics,”!! Cayley included the study 
of Euclidean and spherical geometries in the setting of projective geometry, where 
the ground space for each of these geometries is the interior of an appropriate conic 
in projective space, the conic itself becoming the “absolute” of the space. He noted 
that the distance function in such a geometry may be defined in terms of the log- 
arithm of the cross-ratio. In the same paper, Cayley made his famous statement 
that “descriptive geometry is all geometry,”!? an idea which was taken up by Klein 
later on, who, in his 1871 paper “Uber die sogenannte Nicht-Euklidische Geometrie” 
(On the so-called non-Euclidean geometry), '!> included hyperbolic geometry as well 
in the picture, using different kinds of conics and quadrics in dimensions 2 and 3 
respectively. In this work, Klein also gave formulae for the distance functions using 
the cross-ratio in the Euclidean and the two classical non-Euclidean geometries. 

Among the various attempts to generalize hyperbolic geometry using the geom- 
etry of quadrics, we may mention a section in a paper of Poincaré which he wrote 
in 1887, called “Sur les hypothéses fondamentales de la géométrie” (On the funda- 
mental hypotheses of geometry).'* This paper is poorly known to mathematicians. 
The section which interests us here is concise, it contains ideas with no attempts for 
precise statements and proofs. We provide an English translation of it in the prologue 
of this volume, after the present introduction. 

'!0 The authoritative version is the one by R. Rashed, in 4 volumes: Les Coniques, Tome 1, Livre I, 
de Gruyter, Berlid 2008; Tome 2, Livre IV, ibid., 2009; Tome 3, Livre V, ibid., 2008; Tome 4, Livres VI 
et VII, ibid, 2009. 

"| A. Cayley, “A sixth memoir upon quantics,” Phil. Trans. R. Soc. Lond. 149 (1859), 61-90. Reprinted 
in Vol. II of Cayley’s Collected mathematical papers. 

'2 “Descriptive geometry” is the name Cayley used for projective geometry. 

13 F, Klein, “Uber die sogenannte Nicht-Euklidische Geometrie,” Vorgelegt von A. Clebsch, Nachri- 
cheten von der Kgl. Gesellschaft der Wissenschaften zu Géttingen, Ne 17 (30 August 1871). French version, 
“Sur la géométrie dite non euclidienne,” translated by J. Hotiel, Bull. sci. math. et astr. 2 (1871), 341-351. 
See also the commentary in N. A’Campo and A. Papadopoulos, “On Klein’s So-called non-Euclidean 
geometry,” in Sophus Lie and Felix Klein: the Erlangen program and its impact in mathematics and in 
physics (L. Ji and A. Papadopoulos, eds.), EMS Publishing House, Ziirich, 2015, 91-136. 


'4H. Poincaré, “Sur les hypothéses fondamentales de la géométrie.” Bull. Soc. Math. France 15 
(1887), 203-216. 


Introduction XI 


Eduard Study, in a paper published in 1907!° whose translation is contained in 
the present volume, included in the projective setting other geometries than the three 
classical geometries. 

In the rest of this introduction, we will give a quick survey of the various essays 
that constitute the present volume. We have divided this collection of essays into three 
parts. The first part (Chapters 1-12) is the longest, and it is concerned with spherical 
and hyperbolic geometries. The second part (Chapters 13-16) deals with geometries 
defined in the setting of projective geometry. The third part (Chapters 17 and 18) 
concern two other geometries, namely, Hermitian geometry, that is, the geometry of 
complex projective spaces, and an axiomatic plane geometry which is termed “non- 
elliptic metric plane in which every segment has a midpoint.” 

The content of each chapter of this volume is now described in detail. 


Part I. Spherical and hyperbolic geometries 


Chapter 1, written by Norbert A’?Campo and Athanase Papadopoulos, is a survey of 
classical material about area in spherical and hyperbolic geometry. A theorem of 
Albert Girard stating that the area of a spherical triangle is equal (up to a constant 
multiple) to its angle sum, as well as its analogue in hyperbolic geometry, are re- 
viewed. A formula due to Euler for the area of a spherical triangle in terms of its side 
lengths, again with its analogue in hyperbolic geometry, are used in order to give an 
equality for the distance between the midpoints of two sides of a spherical (respec- 
tively hyperbolic) triangle in terms of the third side. These equalities are quantitative 
versions of the formula expressing the fact that the sphere (respectively hyperbolic 
plane) is positively (respectively negatively) curved in the sense of Busemann. The 
essay contains further results related to area in non-Euclidean geometry together with 
some historical comments. 

Chapter 2, by Elena Frenkel and Weixu Su, is based on variational methods intro- 
duced by Euler in non-Euclidean geometry. The authors provide detailed proofs of 
hyperbolic analogues of spherical results obtained by Euler using these methods. This 
includes a derivation of the trigonometric formulae and an area formula for hyper- 
bolic triangles in terms of their side lengths. Euler considered the use of variational 
methods in spherical geometry as an application of the techniques of the calculus of 
variations, a field of which he had laid the foundations. !© 

15 E, Study, “Beitrage zur nichteuklidische Geometrie,” L-IIL., Amer. J. Math. 29 (1907), 101-167. 

'e Cf. L. Euler, “Principes de la trigonométrie sphérique tirés de la méthode des plus grands et des 


plus petits,” Mémoires de l’Académie royale des sciences et belles-lettres 9 (1755), 223-257, in Opera omnia, 
Series 1, Vol. XXVI, 277-308. 
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In Chapter 3, Elena Frenkel and Vincent Alberge study the hyperbolic analogue 
of a problem in spherical geometry solved by Friedrich Theodor von Schubert, a 
young geometer who became a successor of Euler at the Saint Petersburg Academy 
of Sciences and who worked on spherical geometry. Schubert’s results include the 
determination of the loci of the vertices of triangles satisfying some given conditions, 
in the spirit of problems solved by Euler and his other young collaborators Anders 
Johan Lexell and Paul Heinrich Fuss. In this chapter, the authors give a solution in the 
hyperbolic setting of a problem solved by Schubert on the sphere, namely, to find, 
for a triangle with prescribed base and whose vertex varies on a given hypercycle 
consisting of points equidistant to the line containing the base, the point(s) for which 
the area of this triangle is maximal or minimal. They provide two different solutions 
of this problem. 


The reader may note that in the Euclidean case, if we fix two vertices of a triangle 
and if we look for a family of triangles having the same area, the locus of the third 
vertex is a straight line parallel to the base of the triangle (in fact, in the Euclidean 
plane, hypercycles are straight lines). The situations in the hyperbolic and spherical 
cases are different from the Euclidean case since in these cases, the area of a triangle 
is not determined by a side and the corresponding altitude. The desired locus in these 
cases is not a curve equidistant to the basis of the triangle. It is interesting to note 
in this respect that Herbert Busemann solved in 1947 the following related problem: 
“To characterize the geometries in which the following property, which is satisfied in 
Euclidean geometry, holds: The area of a triangle ABC depends only on the length 
of BC and the distance from A to the segment BC.’ Busemann found that the only 
geometries that satisfy this property are what he called the Minkowski planes (that 
is, the metric spaces underlying the 2-dimensional normed vector spaces) in which 
the perpendicularity relation is symmetric (for an appropriate notion of perpendicu- 
larity). !7 

Chapter 4, by Himalaya Senapati, concerns medians in non-Euclidean geometry. 
A result due to J. H. Lambert, from his Theorie der Parallellinien (Theory of par- 
allel lines), written in 1766,° says that for any equilateral triangle ABC with medi- 
ans AA’, BB’ and CC’ intersecting at O, we have OA’ = 4 AA’, OA’ > + AA’ 
and OA’ < + AA’ in Euclidean, spherical and hyperbolic geometry respectively. 
Senapati shows, using non-Euclidean trigonometry, that Lambert’s inequalities hold 
for arbitrary triangles. He also presents a collinearity result in spherical geometry, as 
an application of Menelaus’ theorem. 


'7 Cf. Theorem 50.9 in H. Busemann, The geometry of geodesics, Academic Press, New York, 1955, 
and H. Busemann, “Two-dimensional geometries with elementary areas,” Bull. Amer. Math. Soc. 53 
(1947), 402-407, reprinted in H. Busemann, Selected works (A. Papadopoulos, ed.), Vol. 1, Springer Ver- 
lag, Cham, 2018, 379-384. 
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Chapter 5 is also due to Himalaya Senapati. It concerns the construction in spher- 
ical geometry of a triangle whose three vertices lie on a given circle and whose three 
sides produced pass through three given points. Such a construction, in the Euclidean 
case, is a theorem in Pappus’ Collection,'* in the special case where the three points 
are aligned. Euler generalized the problem to the case where the three points are not 
necessarily aligned and he gave a construction in this general case in the Euclidean 
as well as in the spherical setting.!° In the same volume in which Euler’s solution ap- 
peared, his young collaborator Nicolaus Fuss?° published another proof of the same 
construction for the Euclidean case.*! Lagrange found a new short proof for the Eu- 
clidean case, reducing the problem to finding roots of quadratic equations that can be 
solved using ruler and compass.?? In Senapati’s essay, Lagrange’s solution is adapted 
to the non-Euclidean setting. 


The notion of constructibility by ruler and compass is at the root of geometry. 
It underwent several transformations, from the Greeks to Galois, passing through 
Descartes, Gauss, and others. Senapati’s construction on the sphere, in Chapter 5, 
based on the Euclidean Lagrange solution of the generalized problem of Pappus, is 
in the tradition of Descartes, that is, reducing the construction problem to solutions 
of quadratic equations. 


We note incidentally that the first time Descartes introduced the xy coordinates 
(what we call the “Cartesian coordinates”) is in Book I of his Géométrie, precisely 
in the solution of a construction problem of Pappus. We also recall that the title of 
this book is “Des problémes qu’on peut construire sans y employer que des cercles 
et des lignes droites” that is, “On problems that can be constructed using only circles 
and straight lines.” 


Chapter 6 is again due to Himalaya Senapati. It is based on two propositions 
of Menelaus’ Spherics that are comparison results for angles in a triangle cut by a 
geodesic arc joining the midpoints of two sides. The two propositions, in Menelaus’ 


'8 Pappus dAlexandrie, La collection mathématique, (Euvre traduite pour la premiere fois du grec 
en frangais par P. Ver Eecke, Desclée de Brouwer, Paris and Bruges, 1933, Propositions 105, 107, 108, 
and 117 of Book VII. 

'9 L. Euler, “Problematis cuiusdam Pappi Alexandrini constructio,” Acta Academiae scientarum im- 
perialis petropolitanae, Pars 1 (1780), 91-96. In Opera omnia, Series 1, Vol. XXVI, 237-242. 

20 Nicolaus Fuss (1755-1826) was initially Euler’s secretary, and he became later his student, collab- 
orator, and colleague at the Russian Academy of Sciences, and eventually the husband of his grand- 
daughter Albertine. 

21.N. Fuss, “Solutio problematis geometrici Pappi Alexandrini,” Acta Academiae scientarum imperi- 
alis petropolitanae, Pars 1 (1780), 97-104. 

22 J.-L. Lagrange, “Solution algébrique d’un probleme de géométrie,” in Oeuvres de Lagrange, 
Vol. IV, Gauthier-Villars, Paris 1868, 335-339. There is a confusion about the authorship of this work. 
It first appeared in a memoir under the name of J. de Castillon (Giovanni Francesco Mauro Melchiorre 
Salvemini, also called il Castiglione, after his birthplace, Castiglione del Valdarno), in which the latter de- 
clares that this proof is due to Lagrange, and after that, he provides his own proof, which is different from 
Lagrange’s. The part of Castillon’s memoir that corresponds to Lagrange’s proof appeared eventually in 
Lagrange’s Euvres under his own name. 
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treatise, belong to a group of propositions in which the author proves for spherical 
triangles the property called today the Busemann property for positively curved met- 
ric spaces. Senapati gives an improved version of the angle comparison result in the 
case of spherical triangles as well as a version for hyperbolic triangles. 

In Chapter 7, Dmitriy Slutskiy gives a proof of the trigonometric formulae of 
spherical and hyperbolic geometries using a kinematic method, following the work 
of Joseph Marie de Tilly (1837-1906). This approach is explained in a memoir by 
the latter, published in 1870 under the title Etudes de mécanique abstraite (Studies 
in abstract mechanics).?> The basic tools used are two functions that de Tilly calls 
eq(r) and circ(r) respectively, the first one being the length of a curve at distance 
r from a geodesic of length one, and the second one being the length of a circle of 
radius r. The work is model-free. 

Chapter 8, by Son Lam Ho, is concerned with the classical Gauss—Bonnet formula 
for surfaces of variable curvature. This is again a result on area. It says that the 
integral of the Gaussian curvature over a closed surface in the Euclidean space R?, 
with respect to the area form, is equal to 27 times the Euler characteristic of the 
surface. This formula is one of the first formulae that establish a relation between, 
on the one hand, topology (the Euler characteristic) and on the other hand, geometry 
(curvature and area). The author explains the relation between the Gauss—Bonnet 
formula and the formula for the area of a triangle in terms of angle excess. 

Chapter 9, by Charalampos Charitos and Ioannis Papadoperakis, is based on a 
theorem of Euler on cartography. Euler’s result says that there is no perfect map 
from an open subset of the sphere into the Euclidean plane. Here, a perfect map is a 
smooth map that preserves distances infinitesimally along the meridians and parallels 
as well as angles between these lines. The theorem is generally quoted with a different 
(and wrong) statement in the literature by historians of mathematics (and sometimes 
mathematicians who followed the historians’ writings) who misunderstood the state- 
ment.?4 In the present essay, Euler’s precise statement and a detailed proof of it are 
provided. 

23 J.-M. de Tilly, Etudes de mécanique abstraite, Mémoires couronnée et autres mémoires publiés par 
lAcadémie Royale de Belgique, Vol. XXI, 1870. 

24 An instance of a wrong quote is contained in R. Ossermann’s “Mathematical mapping from Merca- 
tor to the millenium,” in Mathematical adventures for students and amateurs (D. F. Hayes and T. Shubin, 
eds.), The Mathematical Society of America, Washington (D.C.), 2004, 237-257, Spectrum. Theorem 1 
there, attributed to Euler, states the following: It is impossible to make an exact scale map of any part 
of a spherical surface, an exact scale map meaning, in that paper, a map that preserves distances up to 
scale. The author refers to Euler’s paper “De repraesentatione superficiei sphaericae super plano pub- 
lished” in Acta Academiae scientarum imperialis petropolitanae, Pars 1 (1777), 107-132 (Opera omnia, 
Series 1, Vol. XXVIII, 248-275). The theorem stated in Ossermann’s paper is not what Euler proves. 
This result was known long before Euler, since Greek antiquity, and Euler’s theorem is much more in- 
volved than that. The same error occurs in the paper “Curvature and the notion of space” by A. Knoebel, 


J. Lodder, R. Laubenbache, and D. Pengelley, in Mathematical masterpieces, Springer Verlag, New York, 
2007, 159-227, and in many other papers. 
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Chapter 10, by Charalampos Charitos, is based again on certain projection maps 
from (subsets of) the sphere to the Euclidean plane that were considered by Euler. 
The study concerns more precisely the class of area-preserving projections of class 
C? that satisfy the further property that they send all meridians and parallels to per- 
pendicular curves in the Euclidean plane. This class of maps was highlighted by Euler 
in his memoir “De repraesentatione superficiei sphaericae super plano” (On the rep- 
resentation of spherical surfaces onto the plane).?+ The distortion from conformality 
of these projections is compared with that of the so-called “Lambert cylindrical equal 
area projection,” a map introduced by Lambert and rediscovered by Euler. The author 
shows that the distortion of the Lambert projection has remarkable extremal proper- 
ties in the class considered. 

Thus, Chapters 9 and 10 concern the search of maps between non-Euclidean and 
Euclidean geometries that are best in an appropriate sense. It is also interesting to see 
that by their study of such maps, Euler, Lambert and the geographers that preceded 
them are in fact predecessors of the modern theory of quasiconformal mappings. The 
interested reader may refer to the recent survey entitled “Quasiconformal mappings, 
from Ptolemy’s geography to the work of Teichmiiller.’?> 

Chapter 11, by Nikolai Abrosimov and Alexander Mednykh, consists of two parts. 
The first one concerns area, and the second one volume, in spherical and hyperbolic 
geometries. 

In the first part, the authors survey various classical formulae by Euler, Cagnoli, 
Lhuillier and others on the area of a triangle in spherical geometry and their coun- 
terparts in hyperbolic geometry. They also present non-Euclidean analogues of a 
Euclidean area formula due to Bretschneider and other recent formulae for the area 
of non-Euclidean triangles and quadrilaterals, together with non-Euclidean versions 
and generalizations of an identity of Ptolemy concerning cyclic quadrilaterals, and 
related identities due to Casey, and others. 

In the second part, the authors survey classical formulae for volumes of various 
kinds of polyhedra due to Schlafli, Lobachevsky, Bolyai, Coxeter, and Vinberg. They 
discuss in particular the case of an orthoscheme, that is, a simplex in which the edges 
are mutually orthogonal. A three-dimensional orthoscheme has three right dihedral 
angles, the other dihedral angles being termed esesential. Schlafli provided a formula 
for the volume of a spherical orthoscheme in terms of its essential dihedral angles, 
using a function which is now called the Schlafli function. Lobachevsky and Bolyai 
obtained formulae for volumes of hyperbolic orthoschemes. A theory of volume of 
ideal hyperbolic tetrahedra originated in the work of Lobachevsky done in the second 

25 A. Papadopoulos, “Quasiconformal mappings, from Ptolemy’s geography to the work of Teich- 
miiller,” in Uniformization, Riemann—Hilbert correspondence, Calabi-Yau manifolds, and Picard—Fuchs 


equations (L. Ji and S.-T. Yau, eds.), Advanced Lectures in Mathematics 42, Higher Education Press, 
Beijing, and International Press, Boston, 2018, pp. 237-315. 
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quarter of the nineteenth century. This theory was revived by Thurston and Milnor 
in the late 1970s. Vinberg considered in detail the case of tetrahedra having at least 
one vertex at infinity. 


Together with these developments, the authors in Chapter 11 present several other 
relatively recent results on volume, including a result of Abrosimov on a problem of 
Seidel asking for the expression of the volume of an ideal hyperbolic tetrahedron in 
terms of the determinant and the permanent of its so-called “Gram matrix.” They 
also report on a formula, due to Derevin and Mednykh, expressing the volume of a 
compact hyperbolic tetrahedron in terms of its dihedral angles, and on a formula due 
to Sforza concerning the volume of a compact tetrahedron in hyperbolic or spher- 
ical 3-space. They also present formulae for volumes of spherical and hyperbolic 
octahedra with various kinds of symmetries. 


Chapter 12 by Ivan Izmestiev is concerned with rigidity problems of bar-and- 
joint frameworks in Euclidean, spherical and hyperbolic geometries. A bar-and-joint 
framework is an object made of rigid bars connected at their ends by universal joints, 
that is, joints that allow the incident bars to rotate in any direction. The questions that 
are addressed in this chapter generalize rigidity questions for polyhedra. The author 
approaches them from two points of view, which he calls the static and the kinematic. 
In the scientific jargon, statics is the science of equilibrium of forces, and kinematics 
the science of motion. In the present setting, the static rigidity of a framework refers 
to the fact that every system of forces with zero sum and zero moment applied on it 
can be compensated by stresses in the bars, whereas its kinematic rigidity refers to the 
absence of deformations that keep the lengths of bars constant to the first order. The 
author shows that these two notions of rigidity of a framework are equivalent. More 
generally, he proves that a framework in a Euclidean, spherical, or hyperbolic space 
has the same number of static and kinematic degrees of freedom. Here, the number 
of static degrees of freedom is the dimension of the vector space of unresolvable 
loads, that is, of systems of forces applied to the nodes that cannot be compensated 
by stresses in the bars. In particular, a framework is statically rigid if the number of 
its static degrees of freedom is zero. The number of kinematic degrees of freedom is 
the dimension of the vector space of non-trivial infinitesimal isometric deformations, 
that is, deformations that are not induced by an isometry of the ambient space. 


The subject of statics, in the sense used in this essay, finds its roots in the 19" 
century works of Poinsot, Mobius, Grassmann, and others. Questions of infinites- 
imal rigidity of smooth surfaces were addressed in the 20" century by H. Weyl, 
A. D. Alexandrov, and A. V. Pogorelov. Izmestiev in Chapter 12 provides a projective 
interpretation of statics which allows him to prove a projective invariance property 
of infinitesimal rigidity and to establish a correspondence between the infinitesimal 
motion of a Euclidean framework and its geodesic realization in the two geometries 


Introduction XVII 


of constant nonzero curvature. He refers to the fact that the number of degrees of free- 
dom of a Euclidean framework is a projective invariant as the Darboux—Sauer cor- 
respondence. He also reviews the so-called “Maxwell—Cremona” correspondence 
for a framework in Euclidean and in the two non-Euclidean geometries. This cor- 
respondence establishes an equivalence between the existence of a self-stress of a 
framework (a collection of stresses in its bars that resolves a zero load), a reciprocal 
diagram (a framework whose combinatorics is dual to the combinatorics of a given 
framework), and a so-called “polyhedral lift" (which, in the Euclidean case, is a ver- 
tical lift to 3-space which has the property that the images of the vertices of every 
face are coplanar). 


Part II. Projective geometries 


Chapters 13 and 14 inaugurate the second part of the volume. They consist of a 
translation (made by Annette A’Campo-Neuen) and a short commentary (by Annette 
A’Campo-Neuen and Athanase Papadopoulos) on a paper by Eduard Study, published 
in 1907, entitled “Beitrage zur Nicht-Euklidischen Geometrie. I’’!> In this paper, 
Study introduces what he calls the exterior plane hyperbolic geometry, which turns 
out to be the geometry we call today the de Sitter geometry. This is a geometry of 
the complement of hyperbolic space, when this space is realized as the Cayley—Klein 
disc model sitting in the projective plane. The exterior space is also the space of lines 
in the hyperbolic plane. A distance function (which is not a metric in the sense we 
intend it today) is defined on pairs of points in this exterior space, and the way the 
distance between two points is defined depends on the position of these points, more 
precisely, on whether the line joining them intersects the unit circle, or is tangent to it, 
or is disjoint from it. Study gives a characterization of triangles for which the triangle 
inequality holds, and of triangles for which the reverse triangle inequality (called the 
time inequality) holds. He also discusses the cases where the distance between two 
points is equal to the length of the longest curve joining them. 

It is interesting to see that the ideas expressed by Study in this paper are explored 
in modern research. They appear in the two essays that follow (Chapters 15 and 16). 

Chapter 15 is also due to Ivan Izmestiev. It consists of an exposition of the theory 
of conics in spherical and hyperbolic spaces. The author presents this theory in full 
detail, including classifications of conics from the algebraic and analytic points of 
view, the theory of pencils, the characteristic properties of foci, axes, and centers, the 
bifocal properties of spherical conics, the various concepts of polarity and duality as 
well as the non-Euclidean versions of several results including Poncelet, Brianchon, 
Pascal and Chasles’ theorems, and Ivory’s lemma. The latter states that the diagonals 
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of a quadrilateral formed by four confocal conics have equal lengths. De Sitter space 
is also included in the picture. 

The topic of spherical conics is classical. Back in 1788, Nicolaus Fuss wrote a 
memoir on spherical conics, entitled “De proprietatibus quibusdam ellipses in super- 
ficie sphaerica descriptae” (On some properties of an ellipse traced on a spherical 
surface).2° Michel Chasles made a systematic study of spherical conics in his 1860 
article “Résumé d’une théorie des coniques sphériques homofocales et des surfaces 
du second ordre homofocales” (Summary of a theory of spherical homofocal conics 
and second-order homofocal surfaces).?” William Story, in a paper on non-Euclidean 
geometry published in 1881,?° computed areas of conics in the hyperbolic plane; his 
results are expressed in terms of elliptic integrals. In 1883, he wrote a paper on hy- 
perbolic conics entitled “On non-Euclidean properties of conics.”’?? Today, the sub- 
ject of non-Euclidean conics is almost forgotten, and even, the study of Euclidean 
conics is no more part of the curricula. Izmestiev’s essay, based on the articles by 
Chasles and Story, revives this subject, showing at the same time that Euclidean and 
non-Euclidean geometries (including de Sitter) can be approached via the study of 
(Euclidean) conics. From this point of view, a non-Euclidean conic is represented 
by a pair of quadratic forms on Euclidean space, the first one (assumed to be non- 
degenerate) representing the absolute of a non-Euclidean geometry, and the second 
one being the conic in the non-Euclidean space realized by the first conic. 

In Chapter 16, Francois Fillastre and Andrea Seppi start from the fact, which we 
mentioned several times, that in dimension two, a model for each of the three clas- 
sical geometries can be obtained as the interior of (a connected component of the 
complement of) a certain conic, the latter becoming the “absolute” of the space. The 
authors survey several two- and three-dimensional geometries that may be obtained, 
in the tradition of Klein, in the setting of projective geometry. These geometries in- 
clude, besides the classical Euclidean, spherical and hyperbolic geometries, de Sitter 
and anti-de Sitter geometries, as well as geometries that the authors call co-Euclidean 
and co-Minkowski. The last two are respectively the geometry of the space of hyper- 
planes in Euclidean space and that of space-like hyperplanes in Minkowski geometry. 
Some of these geometries appear as dual to or as limits of other geometries. The de- 
velopments of the theory presented in this essay use duality theory in Euclidean space 
associated to convex sets (polar duality). A convex complementary component of a 
quadric in R? appears as a model of hyperbolic space whereas the other component 
appears as a model for de Sitter space and is also described as the space of hyper- 


26 N. Fuss, “De proprietatibus quibusdam ellipses in superficie sphaerica descriptae,’ Nova acta 
Academiae scientiarum imperialis petropolitanae 3 (1785), 90-99. 

27 M. Chasles, “Résumé d’une théorie des coniques sphériques homofocales et des surfaces du second 
ordre homofocales,” J. Math. Pures Appl. (1860), 425-454. 

28 W. E. Story, “On the non-Euclidean trigonometry,” Amer. J. Math. 4 (1881), 332-335. 

29 W. E. Story, “On non-Euclidean properties of conics,” Amer. J. Math. 5 (1883), 358-382. 
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planes in hyperbolic space. The definition of anti-de Sitter space, as a Lorentzian 
manifold of curvature —1, uses bilinear forms of signature 2. 

Questions regarding transitions between geometries are also addressed in Chap- 
ter 16, and in this setting, elliptic and de Sitter geometries limit to co-Euclidean ge- 
ometry. Likewise, Minkowski and co-Minkowski geometries appear as limit geome- 
tries of classical ones. The authors also discuss connections and volume forms for 
their model spaces, leading to connections and volume forms for the degenerate co- 
Euclidean and co-Minkowski geometries. A map they call “infinitesimal Pogorelov 
map” is introduced, between the hyperbolic and Euclidean spaces, and between anti- 
de Sitter and Minkowski spaces. They then study embedded (especially convex) sur- 
faces in the 3-dimensional model spaces, as well as geometric transitions of surfaces. 
They show that the notion of curvature transits between various geometries, under 
rescaling in co-Euclidean and co-Minkowski geometries. 


Part III. Other geometries 


In Chapter 17, Boumediene Et-Taoui provides a survey of Hermitian trigonometry, 
that is, trigonometry in complex projective space. In particular, he presents the two 
laws of sines and the law of cosines for a triangle in the complex projective plane. 
The subject of Hermitian trigonometry is classical. It was first studied by Blaschke 
and Terheggen in their paper “Trigonometria Hermitiana.”°° The exposition in Chap- 
ter 17 follows in part that of Ulrich Brehm in his article “The shape invariant of tri- 
angles and trigonometry in two point homogeneous spaces.’?! Et-Taoui defines at 
the same time the notion of polar triangle associated to a given triangle in complex 
projective space and he uses it to derive the second law of sines. He also estab- 
lishes relations between invariants introduced by Brehm and others. He notes that 
the spherical geometry developed by Eduard Study is a special case of the Hermi- 
tian trigonometry developed in this chapter. In particular, the trigonometric formulae 
of spherical trigonometry can easily be deduced from the Hermitian trigonometric 
formulae. The essay also contains remarks on trigonometry in general symmetric 
spaces, a subject where almost all the basic questions are still open. 

In Chapter 18, Victor Pambuccian presents a result on triangles of fixed area in a 
geometry he describes as the “Bachmann non-elliptic metric plane in which every pair 
of points has a midpoint.” This geometry is defined in an axiomatic way in terms of 

30 W. Blaschke and H. Terheggen, ““Trigonometria Hermitiana,” Rend. Sem. Mat. Univ. Roma Ser. 4 
(1939), 153-161. 


3! U. Brehm, “The shape invariant of triangles and trigonometry in two point homogeneous spaces,” 
Geom. Dedicata 33 (1990), 59-76. 
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groups, involutions in these groups and axioms concerning the relations among these 
involutions. The expression “elliptic metric plane” refers to a geometry in which there 
are three line reflections whose composition is the identity. The result presented says 
that for any given pair of points A and C and for any point B varying in the plane in 
such a way that the area of the triangle ABC is constant, the locus of the midpoints 
of AB and CB consists of two lines symmetric with respect to AC. This result is 
in the trend of a theorem attributed to Lexell (with various proofs by Lexell, Euler, 
Legendre, Steiner, Lebesgue, and several others)?” concerning the locus of the third 
vertex of a spherical triangle with two given vertices and fixed area. 


This set of essays should give the reader a taste of the fundamental notions of 
non-Euclidean geometry and an idea of a variety of problems that are studied in this 
very rich branch of mathematics. 


Vincent Alberge (New York), 
Athanase Papadopoulos (Strasbourg and Beijing), 
December 2018 


32 A. J. Lexell, “Solutio problematis geometrici ex doctrina sphaericorum,” Acta Academiae scien- 
tarum imperialis petropolitanae, Pars 1 (1781), 112-126; L. Euler, “Variae speculationes super area trian- 
gulorum sphaericorum,” Nova acta Academiae scientarum imperialis petropolitanae 10 (1792), 47-62, in 
Opera omnia, Series 1, Vol. XXIX, 253-266; A.-M. Legendre, Eléments de géométrie (11" ed.). Firmin 
Didot, Paris 1817; J. Steiner, “Sur le maximum et le minimum des figures dans le plan, sur la sphére 
et dans l’espace général,” J. Math. Pures Appl. 6 (1841), 105-170; H. Lebesgue, “Démonstration du 
théoréme de Lexell,” Nouvelles annales de mathématiques 14 (1855), 24-26. 


Prologue 
An excerpt from 
“Sur les hypotheses fondamentales de la géométrie”’ * 


Henri Poincaré 


We already know three two-dimensional geometries: 


1. Euclidean geometry, where the angle sum in a triangle is equal to two right 
angles; 


2. Riemann’s geometry, where this sum is greater than two right angles; 


3. Lobachevsky’s geometry, where it is smaller than two right angles. 


These three geometries rely on the same fundamental hypotheses, if we except 
Euclid’s postulatum which the first one admits and the two others reject. Furthermore, 
the principle according to which two points completely determine a line entails an 
exception in Riemann’s geometry and entails no exception in the other two cases. 

When we restrict ourselves to two dimensions, Riemann’s geometry allows a very 
simple interpretation; it does not differ, as is well known, from spherical geometry, 
provided we decide to give the name lines to the great circles of the sphere. 

I shall start by generalizing this interpretation in such a way that it can be extended 
to Lobachevsky’s geometry. 

Consider an arbitrary order-two surface.! We agree to call lines the diametral 
plane? sections of this surface, and circles the non-diametral plane sections. 

* H. Poincaré “Sur les hypothéses fondamentales de la géométrie,” Bull. Soc. Math. France 15 (1887) 
203-216. Translation by A. Papadopoulos. 

'[Translator’s note] A degree-two algebraic surface, also called a quadric. 

2 [Translator’s note] For a given quadric, the locus of midpoints of the system of chords that have 
a fixed direction is a plane, called a diametral plane of the quadric. In the case where the quadric has 
a center of symmetry, a diametral plane is a plane passing through this center of symmetry. There are 
more general points of symmetry for a quadric. In analogy with the case of the great circles and the small 


circles of a sphere, Poincaré calls a Jine the intersection of a quadric surface with a diametral plane, and 
a circle the intersection of a quadric with an arbitrary plane. 
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It remains to define what we mean by the angle made by two lines which intersect 
and by the length of a line segment. 

Through a point on the surface, let two diametral plane sections pass (which we 
decided to call lines). Let us then consider the tangents to these two plane sections 
and the two rectilinear generatrices of the surface which pass through the given point. 
These four lines (in the usual sense of the word) have a certain anharmonic ratio.? 
The angle that we want to define will then be the logarithm of the anharmonic ratio 
if the two generatrices are real, that is, if the surface is a one-sheeted hyperboloid. In 
the other case, our angle will be the same logarithm divided by V/—1. 

Let us consider an are of conic which is part of a diametral plane section (this 
is what we agreed to call a line segment). The two endpoints of the arc and the two 
points at infinity of the conic have, as any system of four points situated on a conic, 
a certain anharmonic ratio. Then we agree to call /ength of the given segment the 
logarithm of this ratio if the conic is a hyperbola and this same logarithm divided by 
PY | if the conic is an ellipse. 

There will be, between the angles and the lengths thus defined, a certain number 
of relations, and these will constitute a collection of theorems which are analogous 
to those of plane geometry. 

This collection of theorems can then take the name quadratic geometry, since 
our starting point was the consideration of a quadric or a second order fundamental 
surface. 

There are several quadratic geometries, because there are several kinds of second 
order surfaces. 

If the fundamental surface is an ellipsoid, then the quadratic geometry does not 
differ from Riemann’s geometry.+ 

If the fundamental surface is a two-sheeted hyperboloid, then the quadratic ge- 
ometry does not differ from that of Lobachevsky. 

If this surface is an elliptic paraboloid, then the quadratic geometry is reduced to 
that of Euclid; this is a limiting case of the preceding two. 

It is clear that we have not exhausted the list of quadratic geometries; because we 
have not considered neither the one-sheeted hyperboloid nor its numerous degenera- 
cies. 

Thus, we can say that there are three main quadratic geometries, which correspond 
to the three types of second-order surfaces with center. 

On the other hand, we have to add to them the geometries that correspond to the 
limiting cases and Euclidean geometry will be part of them. 


3 [Translator’s note] The term anharmonic ratio means the cross ratio. 
4 [Translator’s note] In the 19" century, the term Riemann’s geometry was commonly used for the 
geometry of the sphere. 
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Why did the geometry of the one-sheeted hyperboloid, up to now, escape the 
theorists? This is because it implies the following propositions. 


1. The distance between two points situated on the same rectilinear generator of 
the fundamental surface is zero. 


2. There are two kinds of lines, the first one corresponding to the elliptic diametral 
sections and the other one to the hyperbolic diametral sections. It is impossible, 
using any real motion, to make a line of the first kind coincide with a line of 
the second. 


3. It is impossible to make a line coincide with itself by a real rotation around one 
of its points, as it happens in Euclidean geometry when we rotate a line by 180° 
around one of its points. 


All the geometers have implicitly assumed that these three propositions are false, and 
indeed these three propositions are so much contrary to the habits of our spirit that 
its was not possible to the founders of our geometries to make such a hypothesis and 
think of stating it. 
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1 Introduction 


We discuss various appearances of the notion of area of triangles in non-Euclidean 
geometry (spherical and hyperbolic). We start by recalling the theorem of Girard 
which gives the area of a spherical triangle in terms of its angle sum, and its classical 
proof based on the notion of lune. We give an analogous proof for the corresponding 
theorem in hyperbolic geometry, after introducing an object which replaces lunes in 
this setting, namely, a triangle having two vertices at infinity. We then recall a for- 
mula for Euler for the area of a spherical triangle in terms of its side lengths, and 
its analogue for a hyperbolic triangle. The two formulae are used in order to give a 
precise formula for the distance between the midpoints of two sides of a spherical 
(respectively hyperbolic) triangle, in terms of the third side. These formulae are pre- 
cise quantitative versions of the positivity (respectively negativity) of the curvature 
in the sense of Busemann, in the cases of the sphere and of the hyperbolic plane. 

At several places in this chapter, we make relations between classical theorems 
and the work of Herbert Busemann. An edition of his Collected works was published 
recently by Springer Verlag, see [9]. 
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2 Angle excess and Girard’s theorem in spherical 
geometry 


Let us start by recalling the notion of lune in spherical geometry. 

Two great circles on the sphere intersect at two antipodal points. The complement 
of these two great circles has four components, see Figure 1. A June is one such com- 
ponent. In other words, a lune is a portion of the sphere bounded by two semicircles 
(that is, half of great circles). Each lune has two vertices, and the angles at these two 
vertices are equal. We recall that on the sphere, an angle made by two arcs of great 
circles meeting at a point is the dihedral angle made by the two planes in 3-space con- 
taining the two great circles. Two triangles are called colunar if they share a common 
edge and if their union forms a lune (Figure 2). This notion was known and exten- 
sively used by Menelaus of Alexandria (1°'-2"¢ c. BC) in his Spherics, a major work 
(probably the most important one ever written) on spherical geometry. An edition of 
the Spherics, with and English translation, appeared recently [28]. 


D 


Figure 1. The complement of two great circles consists in four lunes 


Figure 2. Two colunar triangles 
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Given a spherical triangle with vertices A, B, C, its angle excess is the quantity 
A+ B+C-—wa. (We denote by the same letter a vertex and the angle made by 
the two sides meeting at that vertex.) We give a short proof of the fact that the angle 
excess of every spherical triangle is positive. This result is classical: it was known to 
Menelaus; it is Proposition 12 of his Spherics [28]. Menelaus’ proof of this theorem 
is also reproduced in [24]. In the case of a hyperbolic triangle ABC, one defines 
the angle deficit as the quantity m — (A + B + C). In the next theorem, we give a 
different proof of Menelaus’ result and we prove at the same time that in hyperbolic 
geometry, the angle deficit of an arbitrary triangle is positive. 


Theorem 2.1. In spherical (respectively hyperbolic) geometry, the angle excess 
(respectively deficit) of an arbitrary triangle is positive. 


Proof. The proof in both cases is by contradiction. 

Consider first the case of spherical geometry. We know that there exists a triangle 
whose angle excess is positive (take for instance a triangle whose three angles are 
right angles). Assume that there exists a triangle whose angle excess is non-positive. 
Then, since the angle excess function on triangles is continuous, moving continuously 
the three vertices of the first triangle to the vertices of the second one, we find a 
triangle whose angle excess is zero. But with such a triangle we can tile the whole 
plane, and it is easy to see that a plane tiled by triangles whose angle sum is equal 
to two right angles is Euclidean. This is a contradiction. We conclude that the angle 
excess of an arbitrary triangle is positive. 

In hyperbolic geometry, we start with a triangle whose angle deficit is positive. 
For this, we may either consider an ideal triangle, whose three angles are zero, or we 
may take a triangle whose three vertices are close to infinity, that is, they are close to 
the vertices of an ideal triangle. The rest of the proof is analogous. 


We now turn to Girard’s theorem and its proof. 

We let K be one eighth of the total area of the sphere. In other words, K is the 
area of a triangle of angles +, 4, 4. We consider areas of lunes. Two lunes having 
equal angles have the same area, since they are isometric. From this, it follows easily 
that the area of a lune of angle A is equal to 4X A. 


We have the following: 


Theorem 2.2 (Girard’s theorem). For any spherical triangle ABC, we have 


2K 
Area(ABC) = —(A+B+4+C—-nz). 
1 
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Proof. The proof is based on computation of areas of lunes. We consider the figure 
formed by the union of the three great circles that contain the edges of the triangle 


ABC, see Figure 3. We have 
Area(ABDCA) = —A, 
IN 
4K 


Area(BCEAB) = —B, 
1 
4K 

Area(CAFBC) = —C. 
1 


Adding both sides of these three equations, we get 
Area(ABDCA) + Area(BCEAB) + Area(CAFBC) 
= Area(AFDEA) + Area(BCEAB) + Area(CAFBC) 
= Area(ABC) + Area(EFD) + 4K 
= 2 Area(ABC) + 4X, 


which gives 
4K 4K 4K 
2 Area(ABC) + 4K = —A+-—B+——C 
1 us a 


where the term 4K on the left-hand side represents the area of the heimisphere. 


Figure 3 
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Notes. 1.— We used the fact that the area of a lune is proportional to its angle. Strictly 
speaking, this needs a proof, similar to the one that Euclid gave for Proposition | of 
Book VI of the Elements saying that triangles and parallelograms having the same 
base and the same hight have equal areas. A similar remark holds for the hyperbolic 
theorem (Theorem 3.2 below). 


2.— Theorem 2.2 is attributed to Albert Girard (1595-1632), a French mathe- 
matician from Lorraine, who studied and worked in the Netherlands [30]. He stated 
this theorem in his Invention nouvelle en l’algébre (New invention in algebra) [16] 
(1629), in an appendix called “De la mesure de la superficie des triangles et poly- 
gones sphériques, nouvellement inventée” (On the measure and surface of spherical 
triangles and polygons, recently discovered). 

A proof of the same theorem is attributed to Thomas Harriot (1560-1621), but it 
seems that no written proof by him survives, and therefore this remains hypothetical. 
We refer the reader to the discussion of this fact in the work [2], p. 28ff. 


Lagrange, in his memoir “Solution de quelques problémes relatifs aux triangles 
sphériques avec une analyse complete de ces triangles” (Solution of some problems 
relative to spherical triangles with a complete analysis of these triangles) [17], con- 
siders that the proof that Girard gave is not satisfying and he writes that the theorem 
should rather be attributed to Bonaventura Francesco Cavalieri (1598-1647), who 
gave a complete proof of it in his Directorium generale uranometricum (A general 
guide for celestial measures), Bologna, 1632. Lagrange also refers to Cavalieri’s 
“beautiful proof reported on by Wallis” (1616-1703). This proof is contained in Vol. 
II of Wallis’ Collected works [32], p. 875; cf. Figure 4. 


Figure 4. From Wallis’ proof of Girard’s Theorem ([32], Vol. 2, p. 875) 
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Euler, in his memoir [11], gives a proof of Theorem 2.2 using the calculus of 
variations. In his memoir [12], he gives a proof using lunes. Figure 5 is extracted 
from Euler’s paper [12]. In the picture in the middle, the triangle A BC is represented 
with its three colunar triangles. The proof uses the fact that the three points a, b,c, 
which are the antipodals of the vertices of A, B, C of the initial triangle, are at equal 
mutual distances than the points A, B, C (Euler writes: “even though in the figure 
they seem much different’). Euler’s proof then consists in computing the areas of 
the eight triangles in the figure, ABC, ADC, Bc A, CaB,abC, bc A, caB, abc and 
adding them. The sum is the total area of the sphere, and grouping terms gives the 
result. We shall mention below another result of Euler on spherical area, contained 
in his memoir [13]. The interested reader is referred to the book [1] for an English 
translation and a commentary of several memoirs of Euler on spherical trigonometry. 


Fi Z aul —____ 
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Figure 5. From Euler’s paper [12] 


Prouhet, in his paper [26] writes that “one must add that Girard does not claim 
that his proof is rigorous, but only as a provisional probability, promising to give a 
rigorous proof.” He adds: “It is evident that Euler did not read Girard’s work. The 
proof that is attributed to him is due to Cavalieri,” and he gives the reference [10] 
Prouhet, in the same paper, describes the content of Cavalieri’s book, and he ends 
his review with the following biographical note: “Cavalieri (B.) was born in Milan 
in 1598. He entered the Jesuates order and dedicated himself to mathematics. He 
became the devoted student of Galileo. He was distracted from his studies by violent 
gout attacks and he died on December 3, 1667” (the correct year should be 1647). 

3.— Cavalieri’s name, mentioned in the preceding note, is tightly related to area 
and volume in Euclidean space, since he is the author of the so-called Cavalieri prin- 
ciple which says that two bodies in Euclidean space have the same volume if the 
areas of their sections by planes which are parallel to a given plane are equal. For 
obvious reasons, his result is considered as a precursor of integral calculus. Note 
that Cavalieri obtained this result before the invention of the differential calculus of 
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Newton and Leibniz. We also mention a famous related problem from the modern 
period, relating volume and areas of sections (and, more generally, n-volume and 
(n — 1)-volume in R”), called the Busemann—Petty problem (this is Problem | in the 
problem list [7]; Petty was a student of Busemann). The problem asks the following. 
Let K, and K2 be two convex bodies in IR” that are symmetric with respect to the 
origin and such that 


Voln—1(K1 9 A) < Voln-1(K2N A) 


for any hyperplane H passing through the origin. Is it true that Vol, (K1) < Vol, (K2)? 

The problem resisted several years before a complete solution was obtained in 
1999, but there are several variants and generalizations of this problem which con- 
stitute a living research topic. 


3 The hyperbolic analogue 


We give a proof of a hyperbolic analogue of Girard’s formula. The proof is the one 
contained in Thurston’s book [31], p. 83ff. and it is close in spirit to a proof of Gauss 
of the same theorem, which we recall after Thurston’s proof. 

We start with a hyperbolic triangle S(@) with angles a,0,0. Thus, two of its 
vertices are at infinity; cf. Figure 6. The isometry type of such a triangle is determined 
by the angle a. Let A(q@) be the area of S(a). 


Figure 6 
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The following lemma is used in the proof of the main result (Theorem 3.2) and is 
also a special case of it. 


Lemma 3.1. We have A(a) = fix — a), where P is the area of an ideal triangle. 


Proof. From Figure 7, we have 
A(a + B) = A(a) + A(B) — AO), 


and A(0) = P, that is, the area of an ideal triangle. (We recall that all ideal triangles 
are congruent, since the group of motions of the hyperbolic plane acts transitively on 
triples of distinct points at infinity; therefore, any two such triangles have the same 
area). 


Figure 7 
Thus, we have 


P—A(a+ B) = P—A(a)+ P— A(B). 


Therefore the function 
at> P— A(a) 
is linear in an appropriate sense. ! 


' The function is not defined on a vector space, but the meaning of the word linear used here is clear. 
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Thus, we can set P — A(a) = ka where k is some constant. 

For a = 7, the triangle S(@) is degenerate, and its area is zero. Thus, we have 
A(z) = 0. 

From kx = P — A(z), we get P = kz. 

This gives A(a) = k(a —a@) = f(a — a) for alla. 


From this, we can easily deduce the following well-known formula for the area 
of a hyperbolic triangle: 


Theorem 3.2. We have 

P 

Area(ABC) = —(a1# —-A-—B-C) 

a 
where P is, as above, the area of an ideal triangle. 
Proof. Consider an ideal quadrilateral Q (a quadrilateral with four vertices at infin- 
ity). The two diagonals divide Q into four triangles having a common vertex and 
each of which has its two other vertices at infinity. 

We call June the union of two sectors sharing two vertical angles (Figure 8). These 


two sectors are isometric by the central symmetry whose centre is the intersection of 
the two diagonals. 


Figure 8 
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The area of a lune depends only on its central angle. By Lemma 3.1, the area of a 
lune with central angle w is equal to 2k(a —a) with k = P/z, where P is, as above, 
the area of an ideal triangle. 

Consider now a hyperbolic triangle ABC. We draw the three lunes whose central 
angles are the three angles of the triangle. The union of these three lunes is an ideal 
hexagon H (see Figure 9). 


Figure 9 


The area of the ideal hexagon His 4 times the area of an ideal triangle, that is, 
Area(H) = 4kz. 


The sum of the areas of the three lunes is equal to the sum of the area of the 
hexagon H with two times the area of the triangle ABC. Thus, we have 


2k(a — A) + 2k(a — B) + 2k(a —C) = 4k 4+ 2Area(ABC). 


Grouping terms, we get 


Area(ABC) = k(n —-A—B-C). 
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Historical notes. To the best of our knowledge, the first proof of Theorem 3.2 is due 
to the Alsatian mathematician Johann Heinrich Lambert (1728-1777). It is contained 
in his Theorie der Parallellinien (Theory of parallels), written in 1766. This memoir 
is now available in French translation; see [22], an edition of this memoir together 
with a historical and a mathematical commentary. See also [23] and the review [27]; 
they both give an idea of the work of Lambert. In his memoir, Lambert developed the 
bases of hyperbolic geometry, without being sure that such a geometry exists. In fact, 
Lambert, like all his predecessors, thought that such a geometry cannot exist. His aim 
in developing a geometry where all the Euclidean postulates hold except the parallel 
postulate, and where this postulate fails, was to find a contradiction. In doing so, he 
proved several theorems of hyperbolic geometry, and unlike his predecessors, none 
of the counter-intuitive statements that he found concerning that geometry seemed to 
him to be a contradiction. Lambert’s poof of Theorem 3.2 is contained in §81 of his 
memoir. We recall it here. It is amazingly simple. Lambert makes the hypothesis that 
there exists a triangle whose angle sum is less than two right angles. (He also shows 
that this is equivalent to the fact that any triangle satisfies this property.) He then 
shows that the angle excess function, that is, the function that assigns to a triangle 
with angles A, B,C the value m — (A + B + C), is invariant under subdivision. 
To see this, consider a triangle ABC divided into two smaller triangles with angles 
A, E, F and B,C, D as in Figure 10. The angle deficits of the two smaller triangles 
are zm —(A+ E+ F)anda—(B+C + D) respectively. Adding the two quantities, 
and using the fact that D+ EF = 2,wefind27 —-(A+E+F+4+B+C+4+D)= 
2n —(A+ B+C + F), which is the angle deficit of the triangle ABC. Lambert 
concludes that the angle deficit of a hyperbolic triangle is an additive magnitude; 
therefore it is proportional to area. In fact, instead of dividing a triangle into two 
triangles as we did, Lambert reasons on a triangle ABC divided into three triangles 
by three lines EF’, FG, GE, as in Figure 11. The proof is similar. He continues the 
discussion of area in $82 and 83 of his memoir, noting that a similar formula for 
area, that is, Girard’s formula, is valid in spherical geometry, with the angle deficit 
of triangles replaced by angle excess. He also discusses areas of polygons. 


After Lambert, we turn to Gauss. His notes and correspondence on non-Euclidean 
geometry (spherical and hyperbolic) are contained in Volume vu of his Collected 
works [15]. In a letter he wrote to Wolfgang Bolyai on March 6, 1832, Gauss gives 
the formula for the area of a hyperbolic triangle in terms of its angle deficit. See [15], 
Vol. VIL, p. 221. We recall the proof. It uses the notion of parallelism in hyperbolic 
geometry. 


14 N. A’Campo and A. Papadopoulos 


Figure 10. A triangle divided into two triangles 


Cc 


A G B 


Figure 11. Lambert’s proof of the area as angle deficit: additivity of angle deficit 


In the hyperbolic plane, if L is a line and x a point not on L, a line parallel to 
L through x is a line that does not intersect L and that is the limit of two families 
of lines: one family of lines that intersect L and a second family of lines that do not 
intersect L. (See Figure 12.) This definition was used by the three founders of hyper- 
bolic geometry. Lobachevsky, in his Geometrische Untersuchungen zur Theorie der 
Parallellinien (Geometric investigations on the theory of parallel lines) [18], studied 
in detail this relation of parallelism. The trigonometric formulae that he obtained 
involve this function. 
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Figure 12. The two shaded lines are the parallels through x to the line L 


Given a line L and a point P not on L, consider the two parallels to L from P and 
let @ the angle they make. Gauss knew that the area A(#) bounded by L and these 
two parallels is finite (Figure 13). He also knew that the area of an ideal triangle is a 
constant (that is, this area does not depend on the choice of an ideal triangle). Let C 


be this value. 


$ 


Figure 13. The area bounded by the given line and the two parallel rays which make an angle ¢ is 
denoted by A(@) 


Consider the function 


f(b) = A(x — ¢). 
From Figure 14, we have, for any angle ¢ in (0, 7), 


f(a —$) + Ff) =C. 
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o¢/1-¢ 


Figure 14 


Consider now a division of the total angle at some point into three angles and let 
their values be m — ¢, a — y and ¢ + w (Figure 15). We have 


SG) + fW+ fr-—o-W=C 
This gives 
S@) + fW=C-fa-d-W=fEGt+y). 


Nee 
o+v 
my 


Figure 15 


From this, Gauss deduces that the function f is linear, that is, it has the form 


fo) = 46 


where A is a constant. 
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Now given an arbitrary triangle A, we produce three of its sides, as indicated in 
Figure 16. 


Figure 16 


We obtain 


C= f(a) + f(B) + f(y) + Area(A), 


which gives 


Area(A) = A(xn —a— B-y). 


This completes the proof. 


Gauss’s proof is contained in the article [29] by Staeckel and Engel which repro- 
duces the correspondence between Gauss and the two Bolyai. 


Lobachevsky, in his Pangeometry gives another proof of the formula for the area 
of a hyperbolic triangle as angle deficit. See [21], p.48. In the same treatise (p. 65), 
he gives a formula for the area of a hyperbolic triangle in terms of its side lengths, 
more precisely, in terms of the so-called angle of parallelism function of these sides. 
As a matter of fact, the computation of areas and volumes of non-Euclidean objects 
constitute a substantial part of Lobachevsky’s treatise [21] and of some other works, 
e.g. his Application of imaginary geometry to certain integrals (1836) [19] and his 
“Géométrie imaginaire” (Imaginary geometry) (1837) [20]. 
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4 Area and side length 


In this section, ABC is a triangle (which may be spherical or hyperbolic) with side 
lengths a,b, c opposite to the vertices A, B, C, and A is its area. We shall consider 
that the sphere has constant curvature +1 and that the hyperbolic plane has constant 
curvature —1. In particular, the area formula (Theorems 2.2 and 3.2) become respec- 
tively 

A=A+B+C-az 


and 
A=n-(A+B4+C). 


In his memoir [13], Euler gave the following formula expressing the area of ABC 
in terms of its side lengths. 


Theorem 4.1 (Euler’s formula for the area of a spherical triangle [13]). The following 
identity holds: 


-A 


1 1 + cos(a) + cos(b) + cos(c) 
on($a) = 


4cos (4a) cos (4d) cos (4c) ; 


Euler gave a first proof of this formula using a computation, and another proof us- 
ing the calculus of variations. We shall also use the following analogue of 
Euler’s formula for the case of a hyperbolic triangle, a proof based on the calculus of 
variations, given in the paper [14] in the present volume: 


Theorem 4.2 (Euler’s formula for area, hyperbolic analogue). The following identity 


holds: 
1 1 + cosh(a) + cosh(b) + cosh(c) 
cos ( ) = 


-A ) = -— So. 
Z 4 cosh (4a) cosh (35) cosh (4c) 
The following result, in the spherical case, is contained in [25] p. 115. 


Theorem 4.3. In a triangle ABC of area A, let D and E be the midpoints of AB 
and BC (Figure 17). We have, in the spherical case, 


1 1 
cos(DE) = cos (=4) cos (=ACc), 
2 2 
and in the hyperbolic case, 


cosh(DE) = cos (54) cosh (54c), 
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Figure 17. D and E are the midpoints of AB and AC 


Proof. We provide the proof in the spherical case. In the hyperbolic case, the proof 
is analogous. The cosine law in the triangles ABC and ADE gives 


cos(a) — cos(b) cos(c) 


a sin(b) sin(c) oa 
and 
__ cos(DE) — cos (3b) COS (4c) 
si sin (30) si (4c) 2) 


Equating the two values of cos A, we obtain 


cos(DE) = ((cos(a) —cos(b) cos(c)) ( sin (5%) sin (5¢)) 


1 
sin(b) sin(c) 
+ cos (50) cos (5°) sin(b) sin(c)). 


Replacing in the right-hand side cos(b) and cos(c) by their values in terms of sin (45), 


sin (4c), cos (30), and cos (4c), and after some cancellations, we get 


cos(a) + 2cos? ($b) + 2cos? (4c) — 1 
4cos (3d) COS (4c) 
4 cos (4d) cos ($c) 
4cos (3d) Cos (4c) 
_ 1+ cos(a) + cos(b) + cos(c) (<<) 


~ 4cos (3d) cos (4c) cos (3a) 


cos(DE) = 
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Using Euler’s formula (Theorem 4.1), we obtain 


1 1 
cos(DE) = cos (54) cos (54). 


which is the required formula. 


Corollary 4.4. Consider a family of spherical or hyperbolic triangles ABC having 
fixed area and a fixed side length, say, the length of AC is fixed. Then, the distance 
between the midpoints of the sides AB and BC is also constant, for all triangles in 
the family. 


This result is attributed by Papelier ([25], p. 115), in the spherical case, to Guder- 
mann. 


Corollary 4.5. Using the notation of Theorem 4.5, we have, in the spherical case, 
1 
DE > 74c . (3) 


and in the hyperbolic case, 


1 


Notes. 1.— The result of inequation (3) in Corollary 4.5 concerning the sphere was 
known to Menelaus; this is Proposition 27 of his Spherics [28]. 

2.— The cosine laws (formulae (1) and (2)) that we used in the proof of The- 
orem 4.3 express the value of an angle of a spherical triangle in terms of the three 
sides. There is another “cosine law” in spherical geometry which expresses a side in 
terms of the three angles. The two formulae are the same up to changing capital and 
small letters. This is due to polarity theory in spherical geometry. More precisely, 
to any spherical triangle ABC, there is an associated triangle A’ B’C’ whose angles 
satisfy A! = a — BC, B’ = x — AC and C’ = x — AB and whose sides satisfy 
A=a-—B’'C’,B=n-—A'C’ andC = 2 — A’B’,cf. e.g. [25] p.3. One then uses 
the fact that the function cos is even. 

3.— Motivated by the results expressed in Corollary 4.5, Busemann defined a no- 
tion of bounded and zero curvature in a metric space satisfying very mild properties 
(essentially, the fact that one can define midpoints); cf. [5], [6], and [8]. More pre- 
cisely, a metric space (M, d) said to be Busemann positively curved if for any three 
points A, B, and C in M if D is a midpoint of A and B and EF a midpoint of B 
and C, we have 


d(D, E) > 5a(A,C), (5) 
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There are variations of this definition, namely, the space is called Busemann non- 
negatively curved, negatively curved, or non-positively curved depending on whether 
the strict inequality in (5) is replaced by a large inequality, or the reverse strict or large 
inequality respectively. If the inequality is replaced by an equality, the space is said 
to be zero curved. All these notions were extensively used in metric geometry after 
Busemann. 


4.— Theorems 4.1, 4.2, and Heron’s formula in Euclidean geometry show that 
in the three classical geometries the areas of triangles can be expressed using side 
lengths. A result of Busemann shows that these are the only geometries where this 
is possible. Indeed, he proves in [4] that a two-dimensional metric space with very 
mild properties is either locally isometric to the Euclidean, spherical and hyperbolic 
plane if and only if, locally, an area functions exists for which triangles with equal 
sides have equal area. 
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1 Introduction 


The aim of this essay is to prove some analogues in hyperbolic geometry of some of 
Euler’s results in spherical geometry. We base our considerations on two memoirs of 
Euler. In the first memoir [2], Euler gives the proofs of a complete set of spherical 
trigonometric formulae for right triangles using a variational method. The advantage 
of his method is that it does not use the ambient Euclidean 3-space. Therefore, the 
method is in some sense intrinsic and may be used with some modifications in the 
setting of hyperbolic geometry. 

In order to obtain the trigonometric formulae, Euler works in the so-called equidis- 
tant coordinates on the sphere and derives the length element ds in these coordinates. 
He uses at many places the fact that spherical geometry is infinitesimally Euclidean, 
i.e. that the Euclidean relations are satisfied at the level of differentials. 

We work (Theorem 1.1) in the setting of Lobachevsky’s geometry as introduced 
in [4]. There are some intersections between Lobachevsky’s results and Euler’s re- 
sults. Both work in the same coordinates. One difference between the two approaches 
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is that Lobachevsky uses trigonometry, which he develops from first principles, in or- 
der to find the length element ds, whereas Euler uses ds and the property of spherical 
geometry of being infinitesimally Euclidean, in order to reconstruct the trigonometry. 
The second memoir of Euler, [3], deals with an area formula in spherical geometry 
in terms of side lengths and a related problem, known as Lexell’s problem. 
Let us recall Euler’s area formula and the formulation of the Lexell problem in 
the spherical case. 


Theorem 1.1 (Euler area formula). In spherical geometry, the area A of a triangle 
with side lengths a, b, c is given by 


A 1 + cos(a) + cos(b) + cos(c) 
cos ( ) = 


x 4cos (4a) cos (45) cos (4c) , 


Lexell problem. Jn spherical geometry, given two distinct points A, B, determine 
the locus of points P_ such that the area of the triangle with vertices A, B, P is equal 
to some given constant A. 


After proving the hyperbolic analogue of Euler’s area formula (Theorem 3.1), we 
revisit it again and give a solution of the Lexell problem in the hyperbolic case using 
geometric considerations. Our proof gives a new form and a geometric interpretation 
to the Euler area formula. 

The results and methods introduced in this chapter should be interesting if gener- 
alized to the 3-dimensional case, whether spherical or hyperbolic. 

We assume in our proofs that all the functions we are dealing with are enough 
differentiable. 


2 The length element in Lobachevsky’s work 


Euler introduced in [2] the coordinates on the sphere, such that y = const-curves 
are a chosen great circle G and the small circles equidistant to it, x = const-curves 
are the great circles, which are orthogonal to G. Thereafter, he deduced the formula 
for the length element ds in these coordinates 


ds” = cos”(y) dx? + dy’. (1) 
Lobachevsky proved in [4] the analogue of this formula 
ds” = cosh?(y) dx” + dy”. (2) 


In this section we will explain this result and introduce the related concepts and no- 
tions, which will be also of use in the proof of Theorem 1.1. 
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2.1 The theory of parallels and the angle of parallelism 


In the Pangeometry Lobachevsky presents a summary of his lifelong work on hyper- 
bolic geometry. He starts with first principles and develops from there the theory of 
parallel lines. He uses first the characteristic property of hyperbolic geometry: the 
sum of the angles in a right triangle is strictly less than two right angles. Based on 
this fact he defines a function IT, called the angle of parallelism as follows. 

First, we recall the notion of parallel lines in the sense of Lobachevsky. Consider 
a line / anda point P not belonging to /. There is the unique perpendicular from P to 
the line /. We denote it by PP’ and its length by p. Let £ be the set of lines passing 
through P. Each line from £ can be parametrized by the angle 6 € [0,2[, which 
this line makes with the perpendicular P P’ on a given side of a line containing PP’. 
Then the parallel line to | through P (on a given side of PP’) is the line l’ = Ig 
characterized by the property that for 9 < * the lines /g intersect / and for 0 > 0%, 
the lines /g and / do not intersect. Therefore, a parallel line /’ to / is the limiting 
position of lines through P, which intersect / on a given side of P’. The angle 6* is 
called the angle of parallelism (see Figure 1). 


P 
Q* 
Pp l/ 
P’ l 


Figure 1. Angle of parallelism 6* = II(p) 


Definition 2.1. Let / be a line and P a point not on /. Let P’ be the foot of the 
perpendicular dropped from P to / and p the length of PP’. A parallel line to / 
through P makes with the perpendicular PP’ two angles, one of which is acute. 
This angle is called the angle of parallelism, and it is denoted by II(p). 


This notation, used by Lobachevsky, emphasizes the dependence of the angle of 
parallelism on the length p of the perpendicular P P’, which can be seen as the angle 
of parallelism function. 

The first analytic formula in [4] clarifies the relation I1(p) (see [4], p. 25): 


cos(II(p)) = tanh(p). (3) 
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2.2 Equidistant coordinates 


We introduce coordinates in the hyperbolic plane as follows. Let O be a point and 
OX and OY two orthogonal lines, which intersect at O. We choose a side on each 
line with respect to O, which we call positive. These lines will become x- and y-axes 
of the coordinate system (see Figure 2). Let M be a point in the hyperbolic plane. 
We denote by P,, the foot of the perpendicular from M to OX. In analogy with the 
usual Cartesian coordinates in the Euclidean plane, we associate to M the pair of real 
numbers (x, y), given by 


x = +Length(OP,), (4) 
y = +Length(P,M). (5) 


The signs of x and y are chosen as follows. We take x (or y) positive, if M lies in 
the same halfplane bounded by the line OY (or OX) as the positive side of OX (or 
the positive side of OY). Likewise, we take x (or y) negative, if M and the positive 
side of OX (or the positive side of OY) lie in different halfplanes with boundary line 
OY (or OX). 


Figure 2. Equidistant coordinates (x, y) 


Remark 2.2. In these coordinates, the (vy = const)-lines are the line OX and the 
equidistant curves to OX, and the (x = const)-lines are the lines which are orthogo- 
nal to OX. Interesting facts on these coordinates systems in Euclidean, spherical and 
hyperbolic geometries can be found in [7]. 


2.3 The length element ds in coordinates 


We introduce ds in equidistant coordinates (x, y), a formula due to Lobachevsky 
(see [4], p. 43) 
a 


2: 
mG) so) 
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From (3), 


1 
sin(II(y)) = cosh(y). 


Therefore, we can rewrite (6) as (2). 


3 Trigonometry in right triangles 


In this section we will see how to reconstruct trigonometry from ds as in (2) and 
from the assumption that hyperbolic geometry is at the infinitesimal level Euclidean. 
Practically, we assume in the proof of the following result that the hyperbolic relations 
limit to the Euclidean ones. 


Theorem 3.1. In the pv: plane, let ABC be a right triangle with the sides a, 
b, c and angles a, Pee as indicated in Figure 3. Then the following trigonometric 
identities hold: 


: __ tanh(a) _ u __ cos(@) sinh(a) 
sinh(b) = =r cosh(b) = ina) cosh)’ tanh(b) = a 
sinh(c) = ae cosh(c) = saa) tanh(c) = ne, 

cos(a) soci 
m= cosh(a) i a er ean) 


where u = y/cosh?(a) — cos?(q). 


Figure 3. Right hyperbolic triangle 
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Proof. We work in the equidistant coordinates (x, y). We take A as the origin O of 
the coordinate system and the line containing AC as OX. The positive side of OX 
(with respect to O) is a side containing the point C. We choose the positive side of 
OY such that the ordinate of the point B is positive. 

The line AB in coordinates (x, y) can be seen as the graph of a function 


g:[0, b] —> R, 


that is, 
AB = Graph(g) = {(x, g(x)) | x € [0, d]}. 


This provides a parametrization of AB by y: [0,b] > R?, given by x + (x, g(x)), 
and the length of AB can be computed by means of ds (compare with (2)): 


b 
Length (AB) = li ycosh?(g(x)) + (g/(x))? dx. (7) 
0 


Figure 4. Proof of the theorem 


Since g(b) = a, the function g is useful in the search for the relations between 
the sides of the triangle ABC. We used in (7) implicitly another function of this 
type, r: [0, b] — [0,c], given by r(x) = Length(AM), where M has the coordinates 
(x, g(x)). We have r(b) = c. The function under the integral sign in (7) is the 


derivative of r, 
r'(x) = ycosh*(g(x)) + (g/(x))?. (8) 


The idea of the proof is to deduce a differential equation in terms of g(x) and g’(x) 
using the calculus of variations. This equation combined with (8) will allow us to 
find r(b) and the other relations of this theorem. 
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Among all the curves connecting A and B, which can be represented in coor- 
dinates as graphs of differentiable functions, the line segment AB has the minimal 
length. Therefore, the function g(x) minimizes the length functional L, given by 


b 
L(f) = Length(cy) = ; ycosh*( f(x) + (f/(x))? dx (9) 
0 


and defined on the functional space 
Co,q([0, 5], R) = {f:[0,b] > R | f is differentiable, f(0) = 0 and f(b) = c}. 


In (9), we denote by cy a curve connecting A and B, such that it can be represented 
in coordinates as Graph(/). 
The differential of L vanishes necessarily at g and in particular we have 


d 
—L(g+c€h)|<=09 = 0 (10) 
dé 


for every differentiable function h: [0, b] > R with g + che CE, (10, 5], R). 
For the sake of brevity, we introduce a function F: R? > R, given by 


F(x1,%2) = \/ cosh? (x1) + ne 


such that the functional L becomes 


b 


L(f) = / Ff), f'(@)) de. 


0 


The directional derivative in the left-hand side of (10) after integration by parts be- 
comes 


b 

d OF d OF 

7s + ch)|s=0 = i (5, (s@). 809) _ Te ag EOS CD) dx. 
0 


This relation vanishes if and only if the term in brackets is identically zero. We obtain 
this result by taking a positive function g for h. The next relation was named later on 
Euler-Lagrange formula 


OF d OF 
Fy, (8). 80) — 5 (g(x), 8'(x)) = 0, forallx € [0,5]. (11) 
bal dx 0x2 
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We simplify further the notation by means of the functions 


Gila) = e.g"). 7 = 1.2, (12) 

The Euler-Lagrange formula (11) gives the simple relation 
Gi =G, (13) 
On the other hand, we manipulate the function r’(x), which stands under the integral 


sign in L(g) (compare with (8)) 


OF aF 
(g(x), g'(x)) g/(x) + ay 8): g'(x)) g(x), 
1 x2 


" 2 oe 
ps Ox 


or in terms of the functions G; and using further the relation for G;, given by the 
Euler-Lagrange formula (13) 


r"=G.¢'+Grg" =Gi¢g'+Gr.¢" =(Ge8'), 


we finally get 
r'=Grg'+C, 


where C is a constant with respect to the variable x. 
We rewrite this relation using the definition of Gz (see (12)) and the relation r’(x) 


(see (8)) : 
(g’) 
s 2(¢) = ——__"—_ 
(3’)° + cosh" (g) = (g’)? + cosh*(g) —_ 


Resolving this equation in g’ gives 


cosh(g(x)) V cosh? (g¢(x)) — C? 


8 (8) = oe (14) 
and (8) takes the form 
2 
r(x) = ane (15) 


Let M and M' be two points on AB with coordinates (x, y) and (x’, y’), respectively. 
We denote by P and P’ the feet of the perpendiculars dropped from M and M’. 
Let /’ be a halfline, orthogonal to MP. We denote the intersection of /’ with M’P’ 
by L’. We obtain the trirectangular quadrilateral PML’ P’ with the acute angle L’. 
We denote the length of L’ P’ by yo, the length of AM by s, which coincides with 
r(x), the length of MM’ by As, x’ — x by Ax and y’ — y by Ay (see Figure 4). 

Taking M = A, we determine the value of a constant C from a limit Euclidean 
relation for cos @ in the triangle AM’ P’ (see Figure 5). 
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B 


ss 


Figure 5. Case M = A 


In a right hyperbolic triangle holds a limit Euclidean relation: 


@ i Ax 1 
cos(a) = lim — = —~= 
a As>0 As — r’(0) 
We obtain 

C =cos(a). (16) 
We return to the general situation again, i.e. M is an arbitrary point on AB. We denote 
the angle ZAMP by 6. In the triangle MM'L’, the angle ZM'ML’ = 5 — 0. If 
Length(MM") — 0, the angle 7M’L’M tends to $ and for MM’L’ holds a limit 
Euclidean relation for right triangles 


: ! ’ : y’ = 2 
sin(Z M’ML’) = lim : 
As>o As 
The value of 9 = ZAMP depends on the position of M. We define a function 
w: [0, b] — [0, B], such that w(x) = ZAMP, 


3 y’—y2 
Oe Bs 


_ Ay 
lim — 
As—>o As 
g(x) 


FG 


_ vcosh*(g(x)) — C? 
7 cosh(g(x)) . 
Taking x = b in this relation and replacing C by (16), we get 


cosh? (a) — cos?(a) 


cosh(a) 


cos(B) = (17) 
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We get the relations for r(b) and g~!(a) by means of (14) and (15), 


b 


c=rT)= [ro dx = 


0 


cosh(g) 


cosh?(a) — “) 
J cosh?(g) — C2 


dg= acosh ( 1-2 


a 


b 
C : C tanh(a) 
b= [dx = | ——_ ag = sion (TES). 
_ 4 cosh(g) cosh?(g) —C2 ( eit a 


Replacing C by (16), we finally obtain 


/cosh?(a) — cos?(a) 


cosh(c) = rE 


: (18) 


cos(@)tanh(a) _ tanh(a) 


n= sin(a) ~~ tan(a) * 


(19) 


The remaining relations of Theorem 3.1 follow easily from (17)—(19) and from the 
properties of the functions cosh, sinh and cos. 


4 An area formula by a method of Euler 


Theorem 4.1 (hyperbolic Euler Formula). Jn the hyperbolic plane, the area A of a 
triangle with side lengths a, b, c is given by 


A 1 + cosh(a) + cosh(b) + cosh(c) 
cos (=) = — 
( 2 ) 4 cosh ($a) cosh (4b) cosh ($c) 
Beforehand, we deduce a formula for the area of the hyperbolic circular region K;,g 
with radius r and angle ¢ (see Figure 6), which we will use in the proof. 

For this, we cite the result of J.-M. de Tilly from his work on hyperbolic ge- 
ometry [6], see also [5] in this book for the description of a kinematic approach of 
de Tilly. He introduced a function circ(1), which is defined as the circumference of 
the hyperbolic circle of radius r and showed without use of trigonometry that 


circ(r) = 2m sinh(r). (20) 
Lemma 4.2. The area of the hyperbolic circular region K;,g with radius r and angle 


g is 
Area(K;,9) = |g|(cosh(r) — 1). 
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Figure 6. Region Kr, 


Proof. By (20), the circumference of the hyperbolic circle of radius x is 27 sinh(x) 
and hence, the length /(x) of the hyperbolic circular arc of the same radius and angle 
g is sinh x|g|. Then, for the area of circular segment, we have 


Area(K;,9) = |9| [ sini) dx = |g|(cosh(r) — 1). 
0 


Figure 7. Proof of Theorem 1.1 


Proof of Theorem |.1. Let AB be a segment of fixed length a. Let Z be a point in a 
chosen connected component of H?\{/}, where / is a line containing A and B. We 
fix a ray r, starting at Z and consider the area function A: [0,¢€) > Rso, given by 
A(t’) = Area(A BZ’), where Z’ is a point on r such that the length of ZZ’ is t’. 

Besides the function A, we define in the same manner the functions x, y, g and 
w, such that x(t’), y(t’), g(t’) and w(t’) are the values of the side lengths AZ’, BZ’ 
and of the angles 7Z’AB and ZA BZ’, respectively (see Figure 7). 
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We are looking for the expression for the differential dA. We consider the area 
change 
AA(t’) = Area (ABZ') — Area (ABZ). 


The area change is given by 
AA = sgn(Ag) Area(AZZ’) + sgn(Aw) Area(BZZ’) . (21) 
In order to see this, we distinguish the four cases: 
1. sgn(Ag) = sgn(Ay) = 1, 
2. sen(Ag) = sgn(Ay) = —-1, 
3. sen(Ag) = —1, sgn(Aw) = 1, 
4. sgen(Ag) = 1, sgn(Aw) = -1. 


Since AA corresponding to the cases 1 and 3 is equal to —AA in the cases 2 and 4, 
respectively, we reduce our consideration to the cases | and 3. 

In the case 1, one triangle is entirely contained in the other one and formula (21) 
can be easily seen from Figure 8. 

In the case 3 a side of one triangle intersects a side of another triangle. We denote 
the intersection point by O (see Figure 7 for the case 3). 

First, the area difference does not change, if one removes from the triangles A BZ’ 
and ABZ the common triangle AOB 


Area(ABZ’) — Area(ABZ) = Area(BOZ’') — Area(AOZ). 


Further, adding ZOZ’ to both triangles BOZ’ and AOZ does not affect the area 
difference again, therefore we finally get 


Area(A BZ’) — Area(ABZ) = Area(BZZ') — Area(AZZ’), 


which is formula (21) for the case 3. 
We consider 


AA(t’) Area(AZ Z’) Area(BZZ’) 
a : 


Rg De ea, 


t/>+0 

(22) 

Since the area of AZZ’ is enclosed by the areas of the hyperbolic circular segments 
Kx (0),9 and Kx(),g, by Lemma 4.2 we have an estimate 


| Ag|(cosh(x(0)) — 1) < Area(AZZ’) < |Ag|(cosh(x(t')) — 1), 
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from which it follows that 
Area(AZ Z’ 
lim | Ames?) = |Ag| (cosh(x(0)) — 1). (23) 
t/>+ 


For the sake of brevity, we denote x (0) by x and y(0) by y. 
Combining (23) with (22), we get the following relations in terms of the differ- 
entials dA, dg and dw 


dA = (cosh(x) — 1) dy + (cosh(y) — 1) dw. (24) 


In order to rewrite this relation in terms of the differentials of the sides dx and dy, 
we use the First Cosine Law. Applying it to ABZ’, we obtain 
; cosh(a) cosh(x (t’)) — cosh(y(t’)) 
cos(¢(t')) = ——————__ 
sinh(a) sinh(x(t’)) 


By differentiation, we get 
: (—cosh(a) + cosh(y) cosh(x)) dx — sinh(x) sinh(y) dy 
S09 TFS 
sinh(a) sinh“ (x) 


where x, y, and g denote x(0), y(0), and g(0), respectively. 
From now on and until the relation (28), we follow the computations of Euler, 
in order to find a well-suited substitution v = f(a, x, y) for the integration of dA. 


Since on 
sin(g) = V1 —cos2(9) = suh(@) anh) 
where 
w := 1/1 —cosh?(x) — cosh?(a) — cosh?(y) + 2. cosh(a) cosh(x) cosh(y), 
we have 


tip (cosh(a) — cosh(y) cosh(x)) dx + sinh(x) sinh(y) dy 
sinh(x)w : 
Analogously, we obtain 
tip (cosh(a) — cosh(x) cosh(y)) dy + sinh(x) sinh(y) dx 
sinh(y)w , 
Therefore (24) becomes 
__ (cosh(a) — cosh(y) cosh(x)) dx + sinh(x) sinh(y)dy 
7 sinh(x)w 
(cosh(a) — cosh(x) cosh(y)) dy + sinh(x) sinh(y)dx 
7 sinh(y)w 


dA (cosh(x) — 1) 


(cosh(y) — 1). 
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Now, using the trigonometric formulae 


*) _ cosh(x)—1 — _ sinh(x) 


tanh (>) = = 
a 2 sinh(x) cosh(x) + 1 


we simplify our equation 
wdA = tanh (5) (cosh(a) + cosh(y) — cosh(x) — 1) dx 
+ tanh (5) (cosh(a) + cosh(x) — cosh(y) — 1) dy. 
Taking the symmetric expression 
s := cosh(a) + cosh(x) + cosh(y), 
we get 
wdA = (— 1+ s)( tanh (5 )ax + tanh (= ) ay) 
— 2tanh (5) cosh(x) dx — 2 tanh (5) cosh(y) dy. 


Applying again the (former) expression of tanh ( x), we have 


tanh (5) cosh(x) = — tanh (5) + sinh(x), 


so that 
wdA sinh(x) dx + sinh(y) dy 
12. = tanh (5 ) dx + tanh (5) dy — a << a (25) 


Using f tanh (3x) dx = 2I1n (cosh (3x)) and 


/ sinh(x) dx + sinh(y) dy _ 7 d(1+s) 


= In(1 , 
l-+s l+s ptr s) 


we can find the antiderivative of the right-hand side of the equation (25) and we get 


d 
|< =2In a (26) 
l+s (1 +s) cosh ($) 


where g is given by the symmetric expression 


q := cosh (5) cosh (5) cosh (5). 
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Taking p ‘= 745 £ ; and differentiating the equation, we obtain 


Une 
l+s Dp? 
or equivalently, 
pf as a (27) 
Pp ow 
The term its can be expressed as a function of p. Indeed, we note that 


w? + (1 +s)? = 2(1 + cosh(a)(1 + cosh(x))(1 + cosh(y)) 


= 16cosh? (5) cosh? (5) cosh? (5) 


= 164’. 
Hence, 


= /16q?-—(1+s)? 
= /1l6p?—-1. 


and therefore 


l+s 
Then (27) takes the form 
2d, 
(j= —— 
pv 16p? —-1 
Letting now v = ree we obtain 
2d 
dA =-—S_., (28) 


V1—v? 
In terms of v, the primitive of dA can be computed, so that we get 
A=C + 2arccos(v) 


C42 (- + cosh(a) + cosh(x) + cont) ) 
— arccos ( — AAA 
4 cosh (3a) cosh (4x) cosh (3y) 


We determine the constant C by considering the degenerate Euclidean case A = 0. 
If x = a, we get y = 0, such that the form of our equation becomes 
2+ 2cosh 
0 = C + 2arccos (tec) 
4 cosh (5a) 


From 4 cosh? (4a) = 2 + 2cosh(a) follows C = 0. 
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Taking now x = b and y = c, we finally get the result 


(<A) 1 + cosh(a) + cosh(b) + cosh(c) 
cos { =A) = ASA ———_ 
2 4 cosh ($) cosh (2) cosh (£) 

Lemma 4.3. The solution of the Lexell problem is a smooth curve. 


Proof. Let P; be a chosen connected component of H\{/}, where / contains the 
points A and B. The solution of the Lexell problem is the set 


L={ZeP, | Area(ABZ) = A}. 
The set L is a smooth curve if the area function B: P; > Rso, given by 
B(Z) = Area(A BZ) 


is regular or, equivalently, for every point Z € P there is aray r starting at Z, such 
that the value of B changes in the direction of this ray. 

For a fixed Z € Pj, consider Z’ € P, in the interior of 7A’ZB’, where the 
points A’ and B’ belong to the extensions of the sides AZ and BZ respectively (see 
Figure 8). Using Pasch’s axiom of neutral geometry, one can deduce that ABZ is 
contained in ABZ’, which gives B(Z’) > B(Z). This gives that the area value 
changes along ZZ’. Therefore, the function 8 is regular. 


Figure 8. Area function A is regular 


Lemma 4.3 also follows from the fact that the area of triangles is a function that 
depends real analytically on the three vertices and that it is non-degenerate. 


The area formula for hyperbolic triangles 43 


5 The area formula revisited and Lexell’s problem 


We start this section by recalling the Girard theorem, which states that the area of 
a spherical triangle on a sphere of radius 1 is the angular excess, i.e. the (positive) 
difference between the sum of the angles of this triangle and zr. In the case of the 
hyperbolic plane, the following is the hyperbolic analogon of this theorem. 


Theorem 5.1 (hyperbolic Girard theorem). In the hyperbolic plane, the area of a 
triangle with angles a, B and y is given by the angular deficit 


A=na—-(a+B+y). 


A proof of this theorem without use of trigonometry can be found in [1], Sec- 
tion 5.3. 

In his proof of the area formula, Euler fixes the length of one side of triangle, being 
inspired by the Lexell problem, where one is looking for the configuration set of all 
triangles with the same base and the same area. The quantity v, which appears at the 
end of the proof of Theorem 3.1 has the property that it varies or remains constant at 
the same time with the area A, according to the form of (28). Recall that 


_ 1 +cosh(a) + cosh(b) + cosh(c) 


29 
4cosh ($) cosh (3) cosh (§) oe 


In order to give a geometric interpretation of the Euler formula, one is led then to 
look for quantities which will not be affected by the change of the area. 

The crucial construction here is the dissection of the hyperbolic triangle explained 
in [1], Chapter 5.5. 

Before giving this construction, let us note that we use here the term congruence 
of triangles or more generally, of plane figures Fy and F2, as a primary notion of 
hyperbolic geometry, developed from the first principles (see [1] Ch. 2 for more 
details). We will write F; = F2 for this relation. Moreover, we say that two figures 
F, and F» are scissors equivalent, if one of these figures can be cut into a finite 
number of pieces and reassembled into the other one. It is easy to show that the 
relation of scissors congruence defines an equivalence relation on plane figures and 
that the scissors equivalent figures have the same area, in whatever reasonable sense 
area is defined. (In fact, in Euclid’s Elements the notion of “having the same area” is 
defined as scissors equivalence of figures.) 

Let ABZ be a triangle in the hyperbolic plane, A’ and B’ be the midpoints of 
AZ and BZ respectively, and F, G, H be the feet of the perpendiculars dropped 
from A, Z, B respectively on the line A’B’ (see Figure 9). By construction, the 
following congruences hold: AFA’ = A’GZ and ZGB' = B’BH. From this one 
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deduces easily that the triangle ABZ is scissors equivalent to the Khayyam-Saccheri 
quadrilateral BH FA (i.e., a quadrilateral, which has two opposite edges of the same 
length and makes right angles with the third common edge ([1], Definition 3.28)). 


A B 


Figure 9. Dissection of hyperbolic triangle and the Lexell problem 


This construction is closely related to the solution of the Lexell problem. Indeed, 
consider a triangle with notation as in Figure 9. Let A denote the area of this triangle. 
Let q be an equidistant curve to the line A’ B’, which contains Z. Then every triangle 
ABZ’ such that Z’ lies on q and close to Z has the same area A. This follows 
immediately from the consideration that all such triangles are scissors equivalent to 
the same quadrilateral BH FA. By Lemma 4.3, the solution of the Lexell problem is 
a smooth curve. Thus, the solution is exactly the curve q. 

Let m denote the hyperbolic length of A’ B’. By construction, in the quadrilateral 
BH FA the length of HF is 2m, and the length of A F is the same as the length of BH 
and is equal to / and the length of AB is a. Therefore, these quantities can be related 
to v, since they remain constant if the area stays constant. A direct computation shows 
1 + cosh(a) + cosh(x) + cosh(y) 


h = 
cosh(m) 4 cosh (5) cosh (>) 


Thus the quantity v turns out to be 


_ cosh(m) 
~~ cosh ($) 


and we get immediately an alternative elegant expression for the hyperbolic Euler 
formula . ‘ 
cos ( ) _ os (m) (30) 


2 cosh ($) 
As a conclusion, we will give a geometric interpretation of (30) on the quadrilateral 


BHFA. The perpendicular bisector of AB divides the Khayyam—Saccheri quadri- 
lateral BH FA into two congruent trirectangular quadrilaterals ({1], Chapter 3.6). 
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On the one hand, by the dissection argument, the area of such a quadrilateral is equal 
to 4 and on the other hand, by the Hyperbolic Girard theorem, it is given by the angle 


defect 27 — (a + 32) = . — a. So we have 


cos (5) = COs (> - 9) = sin(@). 


Thus the formula (30) reflects the trigonometrical relation in the trirectangular quadri- 
laterals 
‘ae cosh(m) 
sin(a) = ———_~, 
cosh ($) 
which can be found in [1], Chapter 6.6. The proof is a trigonometric computation 
which uses the trigonometric formulas on triangles introduced in this chapter. 
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1 Introduction 


This essay is motivated by the paper “Problematis cuiusdam sphaerici solutio” 
(Solution of a problem on the sphere) [8] of F. T. von Schubert,! where he solves 
an extremal problem in spherical geometry, namely, for a triangle of given base and 
fixed altitude, he proves that the minimal area is attained at the point whose the foot 
of the corresponding altitude is the midpoint of the base, and the maximal area is 
attained at the point which gives the minimal area for the antipodal given base. See 
Figure | for more details. 

The purpose of this essay is to solve the equivalent problem in hyperbolic geom- 
etry. More precisely, let A and B be two fixed points on a geodesic line S in the 
hyperbolic plane, and €(h) be a connected component of the set of points which are 
at distance h from S. Such a component is called a hypercycle and the geodesic line 
G is called the axis of €(h). Thus, any point which lies on this hypercycle defines a 
triangle whose base is AB and altitude is h. Therefore, we are interested in finding 


' Friedrich Theodor von Schubert (1758-1825) was a German geographer and geometer, and he was 
a successor of Euler at the Saint Petersburg Academy of Sciences. 
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the point on €(/) which determines the triangle with the maximal or minimal area. 
The hyperbolic analogue of Schubert’s result is: 


Theorem 1. Among the hyperbolic triangles with base AB and third vertex belong- 
ing to the hypercycle €(h), the isosceles triangle is the unique triangle with maximal 
area, and it is the only one with extremal area. 


In order to give a proof, we organized this essay as follows. We shall start by 
recalling some basic notions in hyperbolic geometry such as trigonometric formulae, 
area formulae and the Poincaré upper half-plane model. We shall then prove in an 
elementary way that there is only one critical point for the corresponding area func- 
tion, a critical point which solves our problem. Finally, we shall give another proof 
by using a curve which is the analogue in hyperbolic geometry of the so-called Lexell 
curve. Such a curve in the hyperbolic setting, was defined in [1] and it is also studied 
in the present volume by the second author in a chapter written in collaboration with 
Weixu Su (see [4]). Roughly speaking, the Lexell curve represents the level set of the 
area function of triangles where the base is fixed. 


Figure 1. A geodesic line S with two points A and B and an equidistant curve € (h) at (spherical) 
distance h from §. The points C and C’ such that AC = CB and AC’ = BC’ give the triangles 
ABC and ABC’ with minimal and maximal area, respectively. Furthermore, C and C’ are antipodal 
on the small circle € (A). 
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2 Background in hyperbolic geometry 


This section can be skipped by the reader familiar with hyperbolic geometry. We 
shall only give necessary formulae and backgrounds for the proof of Theorem 1. 
We recall that the area of a hyperbolic triangle A BC is given by the angular deficit 


Area(ABC) =a BAC — ABC — ACB. (1) 


For an elementary proof we refer to [2] in this present volume. Hence, if the triangle 
is right at A we have 


Area(ABC) = = — ABC - ACB. (2) 


We continue with hyperbolic right-angled triangles by giving some trigonometric 
relations. Let ABC be a triangle which is is right at A and who whose lengths AB and AC 
are c and fh, respectively. If we set y = ACB and p= ABC (see Figure 2), then 


tanh (c) __ tanh (/) 


sinh (h) anh an sinh (c) 


tan (y) = (3) 


Figure 2. A right-angled hyperbolic triangle 


It is worth noting that relation (3) can be proven in many different ways. In partic- 
ular, a proof which uses a kinematic approach, can be found in [3] by J. M. de Tilly. 
See also the essay by Slutskiy [9] in the present volume. 

A straightforward application of relations (2) and (3) is the following. 


Lemma 2 (area formula). Let T be a right-angled hyperbolic triangle whose base 
length is c and altitude h. Then 


Area(7’) = 2 arctan (tanh (3) tanh (5)). 
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Although this is elementary, for the convenience of the reader we provide a proof. 


Proof. Let ABC be a right-angled triangle as in Figure 2. From classical relations 
between hyperbolic? and trigonometric functions and from relations (2) and (3) we 
get 


1 
tan (Area(A BC)) = —————— 
(Area( ae G +B) 
_ 1~tan (y) tan (B) 
tan (y) + tan (B) 
1 
-_ 1— cosh(h) cosh(c) 
~~ tanh(h) anh(c) 
sinh(o) a san(h) 
__ sinh (/) sinh (c) (cosh (h) cosh (c) — 1) 
sinh? (h) cosh (c) + sinh? (c) cosh (h) 
_ sinh (/) sinh (c) (cosh (A) cosh (c) — 1) 
~~ cosh? (h) cosh (c) — cosh (c) + cosh? (c) cosh (h) — cosh (A) 
____ sinh (A) sinh (c) 
~ cosh (h) + cosh (c)’ 
(4) 
Analogously,? 
h c 
tan (2 arctan (tanh (5) tanh (5))) 
2 tanh (4) tanh (S) 
1 = tanh? (4) tanh? (S) 
2 (cosh (h) — 1) (cosh (c) — 1) 
—] . cosh(h)—1\ 2 /cosh(c)—1\2 (5) 
sinh (h) sinh (c) (1 > ( Sete ) ( sahie) ) ) 


___ 2 sinh (1) sinh (c) (cosh (1) — 1) (cosh (c) — 1) 

sinh? (h) sinh? (c) — (cosh (h) — 1)? (cosh (c) — 1)? 
sinh (/) sinh (c) 

~ cosh (i) + cosh (¢) 


Comparing (4) and (5), we obtain the lemma. 


2 We essentially meant cosh?(x) — sinh?(x) = 1 and tanh(2x) = (cosh(x) — 1)/ sinh(x). 
3 We first observed that because c and A are strictly positive, 0 < tanh(h/2)tanh(c/2) < 1, and 
therefore 0 < 2 arctan(tanh(h/2) tanh(c/2)) < 2/2. 
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Finally, let us recall a few facts about the Poincaré upper half-plane model of the 
hyperbolic plane, that is, the upper half-plane H = {z € C | Im(z) > 0} equipped 
with the hyperbolic metric dq defined by 


|Z1 — Z2| 


du (21-22) = 2-08" ( i; for all 21,22 € H. 


|Z1 —Z2| 
With respect to this metric, the isometries are either of the form 


az 


zé€Hr> such that (a,b,c,d) € R* and ad —bc = 1, (6) 


or 
az+b 


zé€Hr> such that (a,b,c,d) € R* andad —bc = —-1. (7) 


Furthermore, the geodesic lines are represented by the half-circles with the centers 
situated on the real axis or by the vertical lines. The hypercycles to a given geodesic 
line are the circular arcs or the half-straight lines that share the same endpoints with 
the geodesic line. These two kinds of hypercycles are elements of Figures 3 and 4 
below. 


3 Anelementary solution 


In the rest of this essay, we work in the Poincaré upper half-plane model. Let A and 
B be two distinct points at hyperbolic distance c. Let G be the geodesic line through 
A and B. Up to composition by an isometry (a mapping of the form (6) or (7)) we 
can assume that the coordinate of B is i and the coordinate of A is e°i, therefore the 
line S corresponds to the positive imaginary axis. Let h be a positive real number and 
let €(h) be a hypercycle of axis G which is at distance from S. Up to an isometry of 
the form (7), we can assume that €(/) is the half-straight line {ae!% | w > 0} where 
6, = 2tan~!(e”). For our problem, we reparametrize €(h), by setting for any real 
number f, C; to be the point on €(h) of coordinate e°/2+! . e'%, According to this 
parametrization, the area function A(t) = Area (A BC;) is an even function, since the 
area remains invariant under reflection with respect to the perpendicular bisector of 
AB (such a reflection corresponds to the isometry z +> e°/Z). For any real number f, 
we set D; to be the foot of the perpendicular from C; to J. The coordinate of D; is 
then e°/2+?4;, By applying Lemma 2 to the triangles AD;C; and BD;C;, we deduce 


A(t) = 2tan7! (tanh (« + 7) tanh (3)) —2tan7! (tanh (« = 4) tanh (5))- (8) 
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For more details we refer to Figure 3. From relation (8), or even from the geometric 
definition of area (Formula (1) above), we deduce 


lim A(t)=0O= lim A(¢). (9) 

t——oo t—>-+0o 
We then have a smooth even function which vanishes at infinity. Hence, for solving 
our extremal problem and then proving the theorem stated above (or see Theorem 3 


below), we just have to prove that t = 0 is the unique critical point of the area 
function. 


E(h) 


Figure 3. A geodesic line S with two points A and B and an equidistant curve € (/) at distance h 
from J. Any point C; on €(h) defines two right-angled triangles AD;C; and BD;C;. The perpen- 
dicular bisector of the line segment AB is denoted by €. 


We have 


Ae eae) Pa) 
- 


1 + tanh? (2) tanh? (¢ + £) 1 + tanh? (4) tanh? (t — 
for all t € R, and then by setting 


2 tanh (4) 


8 TF cant? (Ep tanh? (r+ $y (1 + tanh (Ey tanh? (r=) 


(10) 
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we have 


T= (In? (+ §)) (1+? (5) on? (¢—$)) 
= (1 — tanh? (« = 5) + tanh? (5) tanh? (« + 5)) (11) 
= (1 + tanh? (3) (tanh? (1 — =) — tanh? (: + =)). 


Finally from (10) and (11), we deduce that 
AG=0 —} f=, 
and thus we have proved 


Theorem 3. Let Co be the point on E(h) for which the foot of the altitude from Co is 
the midpoint of AB. Then 


Area(ABC) < Area(ABCo) forall C € €(h). 


Although this proof is elementary, it uses some computations, so the natural ques- 
tion is about the existence of a proof which is entirely geometric. The next section 
will show that such a proof exists and uses the notion of hyperbolic Lexell curves. 


4 The use of the Lexell curve in hyperbolic geometry 


Let us recall the definition of a hyperbolic Lexell curve. We fix a hyperbolic seg- 
ment AB and we pick a third point C which is not on the geodesic containing AB. 
The question is to find the locus of points that define a triangle of base AB and with 
the same area as ABC. Such a problem in spherical geometry is called the Lexell 
problem and was solved by Lexell in [5]. In hyperbolic geometry, A’Campo and Pa- 
padopoulos solved in [1] (Theorem 5.11) that problem and the solution is as follows. 
We consider the geodesic line which passes through the midpoints of AC and BC. 
The hypercycle which passes through C and of axis this geodesic line is the solution 
of the Lexell problem in hyperbolic geometry. We also refer to [7] for further investi- 
gations on Lexell curves in the hyperbolic plane and to the essay [6] by Pambuccian 
in this volume, where the author generalizes the notion of Lexell curve in weaker 
geometries, namely, the Bachmann non-elliptic metric planes. 
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Let us turn back to our problem. Let A and B be two fixed points in the hyperbolic 
plane which lie on the geodesic line J, and let h a positive real number. We denote 
by C one of the two points at distance h from the segment AB such that ABC is 
isosceles. We still use the notation €(h) for the hypercycle of axis S which passes 
through C. Up to an isometry, we can assume that we are in the situation of Figure 4, 
that is, the segment AB lies on a geodesic which is a half-circle and whose center is 
the origin and the point C lies on the imaginary axis. We then take a point D which 
belongs to the altitude from C to AB. It defines an isosceles triangle ABD whose 
area is strictly smaller than that of ABC. Furthermore, the corresponding Lexell 
curve intersects €(/) at two points. By moving along the altitude from C to AB and 
dealing with the corresponding Lexell curve, we define a natural surjective mapping 
from the hypercycle €(/) onto the segment that connects C and the midpoint of AB. 
In conclusion, we deduce that the point for which the area is maximal is the point 
C and also that we can have triangles of same base and altitude of area as small as 
possible. We can refer to Figure 4 for the construction of two Lexell curves in the 
Poincaré upper half-plane. 


Figure 4. Two isosceles triangles ABC and ABD and their corresponding Lexell curves. The curve 
E(h) is the hypercycle passing through C of axis the geodesic line through AB. The points D; and 
Dp are two points on €(h) for which Area(A BD 1) = Area(ABD2) = Area(ABD). 
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1 Introduction 


In this chapter, we focus on medians in spherical and hyperbolic triangles. We first 
state Ceva’s theorem in a form that is valid in all three geometries, Euclidean, spheri- 
cal and hyperbolic. As a corollary, we prove that in any of these geometries, the three 
medians in any triangle are concurrent. We then recall a result of Lambert stating that 
given an equilateral triangle ABC with medians AA’, BB’, and CC’ intersecting 
at O, we have 


/ / / I / / I / 
OA’ ==-AA, OA aan OA oo" 
in Euclidean, spherical and hyperbolic geometry respectively [1] (see also [2] for 
further details). Next, we derive a relation for medians in an arbitrary triangle. More 
precisely, let ABC be a triangle whose three medians intersect at a point O, and A’ 
the midpoint of BC. Then, 


pug = 2cos (=) and ee = 2co8h (+) 
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in spherical and hyperbolic geometries respectively. We prove that in spherical ge- 
ometry, when BC > 2m and AO < 4, AO < 2cos BC x OA’, implying 


AA’ 


OA > —————__.. 
- 1 +2cos (2) 


We prove that in hyperbolic geometry AO < 2cosh Be implying 


AA’ 


OA! > ———————___.. 
‘ 1 + 2cosh (2) 


We conclude this chapter by stating a theorem due to Menelaus where he studies a 
geodesic intersecting three sides of a spherical triangle. As a corollary, we prove that 
in a spherical triangle ABC, the midpoints of AB, AC, BC’, and CB’ are collinear 
where B’ and C’ are antipodal points of B and C respectively. 


2 Ceva’s theorem 


We state a theorem due to Ceva on concurrent lines in Euclidean geometry and its 
analogues in spherical and hyperbolic geometries (see [2] for a proof). 


Theorem 2.1 (Ceva’s theorem). Let ABC be a triangle. Let AA’, BB’, and CC’ be 
concurrent lines with A’, B', and C' lying on BC, CA, and AB. Then, in Euclidean 
geometry, we have 

CA’ BC’ AB’ 

x x =1 

AB C'A BIC 

while in spherical and hyperbolic geometries we have respectively 

sin(CA’) - sin(BC’) 7 sin(AB’) _ 

sin(A’B) sin(C’ A) sin(B/C) 


1 


and 


sinh(CA’) — sinh(BC’) _ sinh(AB’) 
—_—_—_ x ——— x ——— = 
sinh(A’B)  sinh(C’A) ~ sinh(B’C) 


We can restate the above theorem in a form that is true in the three geometries. 


Theorem 2.2. Let ABC be a triangle and let AA’, BB’, and CC’ be concurrent 
lines. Then, in all three geometries, Euclidean, spherical and hyperbolic, we have 


sin(CAA’) sin(A BB’) sin(BCC’) 1 
——— =k ——— =k — = 
sin(A’AB)  sin(B’/BC) — sin(C’CA) 
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Proof. 1n the following, we give a proof of this theorem in spherical geometry. The 
proof in the case of Euclidean and hyperbolic geometries is very similar with an 
appropriate modification to the law of sines. Let the concurrent lines meet at the 
point O. Applying the law of sines to the triangle AOB, we find 
sin(OA) _ sin(OB) _, sin(OAB) _ sin(OB) 

sin(OBA) sin(OAB) sin(OBA) sin(OA) 
Similarly, using the law of sines in triangles BOC and COA, we find 

sin(OBC)  sin(OC) ; sin(OCA)  sin(OA) 

= = Spy oe SS = ee 

sin(OCB)  sin(OB) sin(OAC) sin(OC) 
Multiplying the above three equations, we find 
sin(CAA’)  sin(ABB’) _ sin(BCC’) — sin(OB)__ sin(OC) __ sin(OA) 
—S—_ ho oe = oP LT EA 
sin(A’AB) sin(B/BC)  sin(C’CA) | Sin(OA) _—sin(OB) _ sin(OC) 


Corollary 2.3. The medians of a triangle are concurrent in Euclidean, spherical and 
hyperbolic geometries. 


Proof. We prove it for a spherical triangle. The other proofs are similar. Let ABC be 
a spherical triangle. Since AB = BC = CA = zis not possible, we take without 
loss of generality, BC 4 a. Let BB’ and CC’ be medians and let them intersect 
at O. Extend the great circle arc AO to intersect BC at A’. Since B’C = B’A and 
C’'A = C’B, by Ceva’s theorem, we must have sin A’B = sin CA’ or equivalently, 
A'B =CA' since BC # zx. 


3 Medians of spherical and hyperbolic triangles 


We now recall a classical theorem due to Lambert on lengths of medians of equilateral 
triangles. See [2] for a proof and further details. 


Theorem 3.1 (Lambert’s theorem [1]). Let ABC be an equilateral triangle and let 
the medians AA’, BB', and CC’ intersect at O. We have OA’ = 3AM’, OA’ > 
+AA! , and OA’ < + AA’ for Euclidean, spherical and hyperbolic geometries. 


In the following, we deduce a precise relation between AO and OA’. We then 
prove an appropriate inequality between OA’ and AA’ for arbitrary triangles in spher- 
ical and hyperbolic geometries. 
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Theorem 3.2. Let ABC be a triangle with the midpoints of AB, BC, and CA de- 
noted by C’, A’, and B’. Let the medians AA’, BB’, and CC’ intersect at O. Then, 
we have 

sin(AO) _ 


——— = 20s ( 


sin(OA’) =) on? h() 


2 sinh(OA’) p 


in spherical and hyperbolic geometries respectively. 


A 
B' . Cc’ 
A’ 
Figure 1. Spherical triangle ABC with medians AA’, BB’, and CC’ 


Proof. We begin by proving the theorem in spherical geometry. Let ABC be a spher- 
ical triangle and let A’, B’, and C’ be the midpoints of BC, CA, and AB (see Fig- 
ure 1). Let the medians AA’, BB’, and CC’ intersect at O. Using the law of sines in 
the triangles AOB and A’ OB, we have 


sin(A BB’) sin(BAA’) d sin(CBB’) sin(AA’B) 
———__ = ——  _ and ——~ = ——. 
sin(AO) sin(BO) sin(OA’) sin(BO) 


Dividing side by side the two equations, we find 


sin(OA’) sin(ABB’) __ sin(BAA’) 
xX ———_ = — 


———— —— == (1) 
sin(AO)  sin(CBB’)  sin(AA’B) 
In the triangle 4A’ B, we have 
in(AB in(A’/B in(BAA’)  sin(4BC 
sin(AB) _ sin(A’B) a sin( ) _ sin(5 ) (2) 


sin(AA’B) 7 sin(BAA’) sin(AA’B) ~ sin(AB) ° 
Using the law of sines in the triangles BB’C and BB’ A, we have 


sin(CBB’)  sin(BB’C) 4 Sin(4B!) __sin(AB) 
[ee = See a SS Fo 
sin(B’C) sin(BC) sin(ABB’)  sin(BB’A) 
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Multiplying the two equations and using B‘C = AB’ and sin BB'C = sin(BB’A), 
we get 
sin(CBB') _ sin(AB) 


Se (3) 
sin(ABB’)  sin(BC) 
Substituting (2) and (3) in (1), we find 
sin(OA’) _ sin(5BC)  Sin(AB) _ 1 @) 


sin(AO) __ sin(AB) sin(BC) — 2cos (5BC) 


thus proving the theorem in spherical geometry. 
In the hyperbolic case, the law of sines in a hyperbolic triangle ABC gets modi- 
fied to —— ee ee 
sin(BAC) _ sin(CBA) _ sin(ACB) 
sinh(BC) — sinh(CA) _ sinh(AB) 
where the sine function on the edges has been replaced by the hyperbolic sine func- 
tion. Thus, proceeding in a similar manner, equation (4) becomes 
sinh(AO) 
sinh(OA’) 


= 2cosh (58C). 


We now prove two lemmas that will be used in proving Corollary 3.5. 
Lemma 3.3. [f0 <b <a <2, thenasin(b) > bsin(a). 


Proof. Setting k = B we have 
ob i 
sin(b) — — sin(a) = k [ costes) — cos(x))dx. 
a 
0 


Since k < land0O < kx < x < az, we have cos(kx) > cos(x). This shows 
that the right-hand side of the above is positive proving that sin(b) > ® sin(a) or, 
equivalently, a sin(b) > b sin(a). 


Lemma 3.4. Ifa > 0 andk > 1, then k sinh(a) < sinh(ak). 


Proof. We note that 
a 
sinh(ak) — k sinh(a) = k [ (cost —cosh(x)) dx. 
0 


Since k > 1 and x > 0, cosh(kx) > cosh(x) and thus the right-hand side is positive. 
This proves that sinh(ak) > k sinh(a). 
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Theorem 3.5. Let ABC be a triangle with medians AA’, BB’, and CC’ intersecting 
at O. In spherical geometry, when AO is less than a quarter of a circle, we have 


1 2 
2 9c08 (58) x OA’ if BC > -_ 
sa 1 2 1 
> 2cos (=Bc) x OA’ if BC <= and 200s (=BC) x OA <=, 
2 3 2 2 
In hyperbolic geometry, we have 
AO <2cosh (+2c) x OA! 
5 : 
Proof. We start by proving the theorem in spherical geometry. Theorem 3.2 implies 
; 1 . j 
sin(AO) = 2cos (58C) x sin(OA’). 
When 2 cos (4BC) < lor 2cos (5BC) x OA’ < OA’, by Lemma 3.3, 
1 1 
OA’ x sin (2cos (58C) x 04’) > 2cos (58) x OA! x sin(OA’) 


implying sin (2cos (4BC) x OA’) > sin AO. Since 0 < OA’ < 1, when AO < 4, 
this implies 


1 
AO <2cos (58C) x OA’. 
Similarly, when 2 cos (4BC ) > 1, Lemma 3.3 implies 
1 
ani (2cos (58C) x 04’) < sin(AO). 

When AO < on this leads to 

1 jos 1 , 0 

AO > 2co8 (=BC) x OA’ if2cos (=Bc) x OA < =, 
2 2 2 


We conclude the proof in the spherical case by observing that if BC < 2 then 
2cos ($BC) > 2cos (¥) = 1 while if  < BC < x, then 2cos($BC) <1. 


In hyperbolic geometry, 2 cosh ( 4B Cc ) > 1. By Theorem 3.2 and Lemma 3.4, 
1 1 
sinh(AQ) = 2cosh (58C) sinh(OA’) < sinh (2 cosh (58C) x 4'0). 


This implies AO < 2cosh ( BC ) x A’O proving the theorem in hyperbolic geom- 
etry. 
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4 A consequence of a theorem of Menelaus 


We state below a theorem due to Menelaus [3] that will be used to show collinearity 
of midpoints of two adjacent sides of a spherical triangle and two antipodal points on 
the extended third side. 


Theorem 4.1. (Spherics, Proposition 66). Let ABC be a spherical triangle and let 
a geodesic intersect AB at D, BC at E, and CA at F. Then, 


sin(AD)  sin(BE) _ sin(CF) 

———— x ——_ x —— = 

sin(BD)  sin(CE)  sin(AF) 
Corollary 4.2. Let ABC be an arbitrary spherical triangle. Complete the great 
circle BC and denote the antipodal points of B and C by B' and C’. Also, denote the 
midpoint of BC’ by X and its antipodal point by Y. Consider a great circle passing 


through X and Y other than the one passing through B and C. Let it intersect AB 
at E and AC at F (see Figure 2). Then, 


sin(AE) _ sin(AF) 
sin(BE) — sin(CF)’ 


Figure 2. ABC isa spherical triangle. B’, C’, and Y are antipodal to B,C, and X where BX = XC". 


Proof. By Theorem 4.1, 
sin(A FE) - sin(BX ) . sin(CF) _ 
sin(BE) sin(CX)  sin(AF) 
Since BX = XC’, 
m= CC’ =CX+ XC’ =CX + BX => sin(CX) = sin(BX). 
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Hence, 
sin(AE) _ sin(AF) 
sin(BE) — sin(CF)’ 


Corollary 4.3. The great circle through the midpoints of AB and AC passes through 
X and Y. 


Proof. Let E be the midpoint of AB and let the great circle XEY intersect AC at F. 
By the above corollary, 


sin(AF) — sin(AE) _ 
sin(FC)  sin(EB) ” 


implying sin AF = sin FC. However, since 0 < AC < a, wehave AF = FC. 
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1 Introduction 


We consider the geometrical problem of constructing a triangle using a straightedge 
and compass so that its three vertices lie on a given circle and its three sides when pro- 
duced pass through three given points. Pappus of Alexandria formulated and solved 
this problem in the special case where the three given points are aligned. This is 
Proposition 117 of Book VII of the Collection [1]. Lagrange provided an algebraic 
solution, proving that the vertices of the triangle are constructible points. He com- 
municated his solution to Jean de Castillon, who reproduced it in the first section of 
his paper “Sur une nouvelle propriété des sections coniques” (On a new property on 
conic sections) [2]. In the rest of his this paper, Castillon proceeded to show that the 
vertices of the triangle remain constructible when the circle is replaced by an ellipse 
or a hyperbola. Later, the solution due to Lagrange was published in his Oeuvres com- 
plétes under the title “Solution algébrique d’un probléme de géométrie” (Algebraic 
solution of a geometry problem) [3]. Euler gave another solution and commented 
on the case where the given circle and the three given points are on a sphere [4]. 
Interestingly, this article is followed by one due to Nicolas Fuss, a young collabo- 


68 H. Senapati 


rator of Euler, who similar to Pappus, studies the special case where the three given 
points lie on a line [5]. Eugéne-Charles Catalan also studied this problem in his book 
Théorémes et problémes de géométrie élémentaire (Theorems and problems of ele- 
mentary geometry) [6] where he says that such a problem is known as the “Castillon 
problem” (see the footnote at p. 222 of [6]). 

In this chapter, we first recall the algebraic solution due to Lagrange. The idea 
behind such a solution is to prove that the vertices of the triangle are constructible. 
More precisely, it is to prove that the coordinates of the vertices are roots of some 
quadratic polynomials with constructible coefficients. Let us recall the notion of con- 
structibility that is relevant here. Let S be a given set of points in the Euclidean plane 
with cardinality at least 2.1! We say that a circle is constructible from § if its center is 
an element of S and its radius is the distance between two elements of 5. We say that 
a line is constructible from § if it passes through two elements of 5. We say that a 
point in the Euclidean plane is constructible in one step from § if it is an intersection 
point of two constructible lines, one constructible line and a constructible circle, or 
two constructible circles. We then say that a point P of the Euclidean plane is con- 
structible from § if there exists a finite sequence of points P;, P2,..., Py = P such 
that P; is constructible in one step from 5 and for any i (2 <i <n), the point P; 
is constructible in one step from 8 U { P),..., Pj—1}. In particular, let us denote two 
points from S by O and / and set the distance from O to J as 1. We can now intro- 
duce the Cartesian coordinate system in the Euclidean plane where the coordinates 
of O and J are (0, 0) and (1, 0) respectively. We say that a number x is constructible 
from § if it is a coordinate of a constructible point from 8. It is elementary to prove 
that the set of constructible numbers from S forms a field such that the square root of 
any non-negative constructible number from S is still a constructible number from 8. 
In particular, a root of a quadratic polynomial with constructible (from S) coefficients 
is constructible. 

In the next section, we recall Lagrange’s solution. We then study the problem 
where the given circle and the three given points are on a sphere and give a solution 
in the lineage of Lagrange. 


2 Lagrange’s solution 


The problem is to construct a triangle MNP inscribed in a given circle € of radius 
r so that the three edges of the triangle NM, MP, and PN when produced pass 


' We are using a modern way to formulate constructibility. Needless to say, Lagrange did not use 
this language of set theory or of fields. 
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through three given points A, B, and C respectively (see Figure 1). This construction 
should not require more than the given objects and a straightedge and a compass. To 
be more precise, the set we are dealing with consists of six points, the two points O 
and I defined above, the point of coordinates (r, 0), and the three points A, B, and C. 
Let us just add that the center of the given circle can be taken as the origin O. 

We notice that the vertices M, N, and P are constructible if the angles x = 
AOM y= AON and z = AOP are constructible, where an angle is said to be 
constructible if its sine (or equivalently cosine) is constructible. We set the lengths 
of segments 

OA=a, OB=b, OC=c. 


We denote the angles AOB and AOC by m and n. Here, 1, a, b, c, cos(m), cos(n), 
and r are constructible. In the triangle AON, we have 


Figure 1. An inscribed triangle MNP such that VM, MP, and PN when produced pass through 
A, B, and C respectively 


y-x 


ee 1 _—_—_ 1a 
A= ~(n — M)=-—— 
ON 5% NOM) 5) 


and 


OAN =x ~ AON - ONA = = 77. 


Using the sine law in the triangle AON, we have 


OA _ sin(ONA) _,4 cos (>*) 


ON  sin(OAN) r cos (24) 
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o(cs()co() ~sn()n(3) 
= 1(:os(Z)os(5) +5n(3) 0) 


a 
=> (a—r)cos (5 ) COs (3) = (a+r)sin (5) sin (5) 
ceili al ae 
an ( — } tan 
2 2 atr 
In the triangle BOP, we find 
OPRS and Dig ee ee 
2 2 2 2 
Using the sine law in the triangle BOP, we have 
BO  sin(OPB) b cos (Gowen) 
2S eee BS SS 
PO sin(OBP) r cos (ot m)) 
which reduces to 
: x—m ‘ z—-m b-r (2) 
an an = : 
2 2 b+r 
Similarly, in the triangle COP, we have 
— WW zZz-y —— 1 zZ+y 
OPC 5 and OC 5 +n 5 
This leads to 
tan (7) tan (—=*) = — (3) 
2 2 ctr 


The goal is now to determine the unknowns x, y, and z from equations (1)—-(3). For 
simplicity, let us set 


tan (=) = 5S tan (=) =t tan (=) =u tan (=) = tan (=) = 
a ya i a 7 als 


a-r b—-r _ c—T 

aay °° bate °* eae © 
Here, a, 6, y, p, and g are constructible since a, b, c, m,n, and r are constructible. 
Equations (1)-(3) now become 


Ss u— 
sxt=a, Px Pf 


1+ps 1+ pu l+qt I1+qu 
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The first and the second give 


:=* and wa Put Ps) + Pis— P) (4) 


~ (s—p)— Bp. + ps)” 
Substituting the first and the second in the third gives 


gee Pa ee es 8g (5) 
aq+s —-(+B)p+Bq- p?g+U-fp?+(1+B)pagys ~ 


For conciseness, we set 

F=p(—-p+q)+ B+ pq) and G=p—q+ Bp(l+ pq). 
We also set 

H=—(1+)p+Bq-p?q and K=1—fp*+(1+8)pq. 


Therefore, F, G, H, and K are constructible since 6, p, and g are constructible. 
Equation (5) now becomes 


a-—qs F+Gs 


aero 
aq+s H+Ks 


which implies 
(vK — Gq) x s* + (vH —aG +aq(yK — F))xs+a(yHq—F)=0. (6) 


The coefficients of this quadratic equation in s are constructible, hence s is also con- 

structible. This implies that the right-hand sides of the two equations in (4) are con- 

structible and thus implying that ¢ and u are constructible. Thus 
62 _ 2 


s 2 
re OOOO Te 


l-u 
1+ u2 


cos(x) = cos(z) = 
are constructible. For the existence of the vertices M,N, and P, the angles a s, 
and 5 and hence the tangents s, ¢, and u should be real. When s is real, (4) implies 
t,u € R. Hence we have proved that when equation (6) admits real roots, a triangle 
exists and is constructible so that its vertices lie on a given circle and its three edges 
when produced pass through three given points. When equation (6) does not admit 
real roots, such a triangle does not exist. 


Remark 2.1. We now give an example where such a triangle does not exist. Let the 
three given points and the centre of the given circle be collinear such that m =n = 0. 
This implies 


p=tan(>) =0 and q = tan (5) =0. 
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Substituting these values, we find F = 6B, G = 0, H = 0, and K = 1. Thus, 
equation (6) becomes 
yxs*—ap =0 


with discriminant 


a-—r b-r c-r 
x 


atr b+r ct+r. 

This is negative when only one of the three points is in the interior of the circle or when 
all three are in the interior. Thus, by continuity, there exist approximately collinear 
points (m + n & 0, m,n # 0) with either only one of the points in the interior or 
all three in the interior such that the discriminant is negative. In all these cases, the 
triangle to be inscribed does not exist. 


4aBy =4x 


3 Constructibility on the sphere 


We define here a notion of constructibility on the sphere. This requires the use of an 
auxiliary Euclidean plane. Let us assume that the sphere is of radius 1. We report 
on the auxiliary plane two points at distance 1. Let us denote these two points by O 
and J. Now, let 5; be a set of points on the sphere. On the plane, we draw circles 
centered at O of radius the chord distance between two points of the given set. We 
set Sg as the set which consists of the two points O, J and the intersection points of 
those circles with the half-line starting at O and passing through J. We say that a 
circle is constructible from $4 if its pole is a point of S$; and the chord distance to 
the pole is a constructible number from So. Like for the case of constructibility in the 
plane, we say that a point on the sphere is constructible in one step from 8+ if it is an 
intersection point of two constructible circles. Thus, a point P on the sphere is called 
constructible if there exists a finite sequence of points P;, P2,..., Pn = P such that 
P, is constructible in one step from 5, and for 2 <i <n, P; is constructible in one 
step from S54 U {Pi,..., Pi-1}. 

Following our definition of constructibility on the sphere, we state some conse- 
quences which shall be useful in Section 4. 


Lemma 3.1. A great circle on the sphere is constructible if one of its poles is con- 
structible. 


Proof. Let C be the centre of the sphere and P the constructible pole (see Figure 2). 
Let Q be an arbitrary point on the great circle with pole P. This implies PC = 
QC = 1 and PCO = 5 so that PO = /2. Thus the chord distance of a point 
on the great circle from its pole is constructible on the plane implying that the great 
circle is constructible 
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Lemma 3.2. A geodesic through two given points on the sphere is constructible. 


Proof. Let P and Q be the two given points (see Figure 3). We construct the two great 
circles whose poles are P and Q respectively. Let a point of intersection be called 
C’. We have the chord lengths PC’ = QC’ = V2. Thus the great circle whose pole 
is C’ passes through P and Q. Since C’ is constructible, geodesic through P and Q 
is constructible. 


Figure 2 


Figure 3 
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Lemma 3.3. The sine of the angular separation between two constructible points on 
the sphere is constructible in the Euclidean plane. 


Proof. Let D and E be two given points on the sphere (see Figure 4). Construct 
the geodesic DE and extend it. Using a compass, find the point F on the extended 
geodesic so that crd(DE) = crd(EF) where crd denotes the chord length. We have 
crd(DF) = 2sin(DE). Since the chord distance on the sphere is constructible on 
the plane, sin(D£) is constructible in the Euclidean plane. 


Figure 4. Determination of the sine of the angular separation between two constructible points 


Lemma 3.4. Suppose an angle 6 is constructible in the Euclidean plane. Let D and 
E be two points on the sphere. Then, it is possible to construct a great circle arc DG 
such that EDG = 0. 


Proof. Construct the great circle Cp with pole D. Extend the geodesic DE in the 
direction of E and denote the first point of intersection with this great circle by F 
(see Figure 5). Since @ is constructible in the Euclidean plane, 2 sin (8) is also con- 
structible. Construct the circle Cz with pole F and chord distance to the pole set at 
2 sin (8). Let G be a point of intersection of Cp and CF so that crd(FG) = 2sin (2). 
Thus the great circle arc FG subtends an angle @ at the centre of the sphere. There- 
fore, it also subtends the same angle @ at D since D is the pole of the great circle arc 


FG. Hence, joining DG, we have FDG = 8. 


Lemma 3.5. Suppose DE and DG are two great circle arcs on the sphere. The angle 
EDG is constructible in the Euclidean plane 


Proof. The proof is very similar to that of Lemma 3.4. 
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D 


F 


G 


Figure 5. Given DE, constructing DG such that the angle at D is a given angle 


4 Extension of the problem to spherical geometry 


We now consider the extension of this construction problem to spherical geometry. 
Here, the problem remains to construct a triangle MNP using a straightedge and a 
compass whose vertices lie on a given circle C and whose edges when produced pass 
through three given points A, B, and C. However, the given circle and the three given 
points now lie on a sphere instead of the Euclidean plane (see Figure 6). 


Let O be a pole of the circle C. We set, as before, OM = ON = OP =r, 
OA =a, OB = b, OC =c, AOB =m, and AOC = n. We denote the unknown 
angles AOM, AON and AOP by x, y, and z respectively. 


Figure 6. The circle @ is on the surface of a sphere 
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By Lemma 4.2 (to be given below), the geodesic AM N intersecting the circle C 
at M and N gives the relation 


tan (3) tan (3) = SAO — ay, 


This is reminiscent of equation (1). Similarly, the geodesics BMP and CNP give 
identities reminiscent of equations (2) and (3): 


; (S")e 55") sin(b — r) b 
an an = —__ = 
2 2 sin(b +r) “ 
y-n zZ—n sin(c — r) 
(a eae 
2 2 sin(c + r) 
As before, we set 


tan(=) =s tan(>) =r tan (=) =u tan (=) = tan (=) = 
oY ee CY 7) ae 7? oy a 


Since the angles m and n are constructible on the sphere, by Lemma 3.5, they are con- 


structible on the plane. Hence, p = tan (2) and g = tan (3) are also constructible 


on the plane. We also set F, G, H, and K: 


and 


F = p(-p+q)+B+(+ pq), 

G=p—q+yp(l+ pq), 

H =—(1+ By)p + Bya— pa, 

K =1~—Byp*+ (1+ By)pa. 
With this notation, we obtain the equations 


pa Mt _ Pe + Ps) + P= p) a4+—4qs) F+Gs 


= ) ————- = : 7 
5 i Wafpdits eaig fae 


It is clear that s, t, and wu are constructible in the Euclidean plane if a+, B+, and y+ 
are constructible on the plane. Extending the great circle arcs OA, OB, and OC, 
and denoting the point of intersections with the circle by A+, Bi, and Cy as shown 
in Figure 6, we find AAtj = a+r, BB = b+r, and CCy = c +r that 
are constructible on the sphere. Hence, by Lemma 3.3, sin(a + r), sin(b +1), and 
sin(c + r) are constructible on the plane that implies a+, 8+, and y+ and hence s, f, 
and u are constructible in the Euclidean plane. As shown before in the planar case, 
when s € R, the angles x, y, and z are real and are constructible in the Euclidean 
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plane. By Lemma 3.4, we can construct geodesics OM, ON, and OP so that they 
make angles x, y, and z with OA so that the vertices M, N, and P are constructible. 
When s ¢ R, the required triangle does not exist. We have thus generalized the 
algebraic solution due to Lagrange to the spherical setting. 


Remark 4.1. We give an example where the triangle to be inscribed does not exist. 
Consider the limiting case where the three given points A, B, and C approach O that 
is the pole of the given circle. In this limit, OA, OB, and OC vanish so that w+, B+, 
and y+ approach —1. Similarly, when A, B, and C approach the antipodal point of 
O,a+, B+, and y+ approach —1. In both limits, 


—FxKx1+p* and GxrH ~-q(1+ p’). 
The quadratic equation in s (third equation of (7)) now becomes 
(- + a ~ 1 
q-Ss 
that has no real roots. Thus, there exist A, B, and C near O (or near the antipodal 
point of O) so that the inscribed triangle does not exist. 


It remains to prove the following lemma. 


Lemma 4.2. Let © be a small circle on a sphere with pole O. Let A be a point on 

the sphere. Let a geodesic through A intersect C at points M and N. Then, we have 
| — | — in(OA — OM 

tan (= 40M) tan (= 40N) = ats 

2 2 sin(OA + OM) 


Proof. For conciseness, let us set OM = r, OA = a, AOM = x, and AON = y 
(see Figure 7). By Lemma 2.3 in [7], in the triangle AOM, we have 
2sin(OM) 


tan(OAM) = ——— 
tan (4AOM) sin(OA + OM) + cot ( 


140M) sin(OA— OM) 
while in the triangle AON, we have 
2sin(ON) 


tan(OAN) = —— 
tan (4 AON) sin(OA + ON) + cot ( 


140N) sin(OA — ON) 
Since OAN = OAM and OM = ON, this implies 
sin(a + r) tan (5) + sin(a — r) cot (5) 


= sin(a + r) tan (5) + sin(a —r) cot (5). 
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Solving for tan (3) in terms of tan (3) in this quadratic equation, we find 


wn($)=1n(2) or tan(5)tan(2) = 8e=9. 


When tan (3) = tan (3), x and y differ by an integer multiple of 27 implying that 
M and N coincide. Here, AM is tangent to the circle, hence AMO = of Thus, we 
have cos(x) = cot(a) tan(r). This implies 


x y 2 (Xx 1 —cosx sin(a — Tr) 
a Ce ere ae 
2 2 2 1+cosx  sin(a+r) 


N. 


we 


Figure 7 
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1 Introduction 


The Spherics by Menelaus of Alexandria (1‘'-2"" century CE) is a treatise on spher- 
ical geometry in the spirit of Euclid’s Elements. The propositions follow a logical 
order, starting with the fundamental properties of a spherical triangle and then ex- 
ploring further and more involved properties. A critical edition of this treatise with 
the first English translation accompanied by mathematical and historical commen- 
taries has recently been published [1]. We start by recalling Proposition 27 in this 
treatise. This is a comparison theorem between the base of a spherical triangle and 
the great circle arc joining the midpoints of the two legs. The theorem expresses a 
property of the sphere which later became known as Busemann’s criterion for positive 
curvature in metric spaces [2]. 


Theorem 1.1 (Spherics, Proposition 27). In any spherical triangle ABC, let D 
and E be the midpoints of AB and BC, and let DE be the arc joining them. Then 
DE > $AC. 
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We now state Propositions 28 and 29 of the same treatise. These are comparison 
theorems of a similar sort, now comparing angles instead of edges, when a spherical 
triangle is divided in the same way by a great circle arc joining midpoints of two 
sides. 


Theorem 1.2 (Spherics, Proposition 28). Let ABC be a spherical triangle such that 
the angle B is at least equal to a right angle, let D and E be the midpoints of AB 
and BC respectively and let us join DE. Then, BDE < BAC and BED < BCA. 


Theorem 1.3 (Spherics, Proposition 29). Let ABC be a triangle such that the angle 
A is at least equal to a right angle, let E be the midpoint of BC, D the midpoint of 
AB and G the midpoint of AC. We join ED, DG and GE. Then, BDE < BAC 
and CGE < CAB. 


In what follows, we first prove a monotonicity result in the lineage of Proposi- 
tions 28 and 29. The statement is that in a spherical triangle ABC with BC > AC, if 
the sides CA and CB are increased while holding the angle C and the ratio CA/CB 
constant, the angle CAB increases monotonically. As a corollary, we prove that the 
angle subtended at the base of a spherical triangle is greater than its homologue sub- 
tended at the geodesic joining the midpoints of the legs if the adjacent leg is shorter 
than the other leg. Here, the assumption on angles in Propositions 28 and 29 has been 
replaced with the assumption on sides. We then extend this comparison theorem to 
hyperbolic geometry proving that in a hyperbolic triangle ABC with BC > AC, if 
the sides CA and CB are increased while holding the angle C and the ratio CA/CB 
constant, then the angle CAB decreases monotonically. 


2 Monotonicity in spherical triangles 


A spherical triangle is a triangle formed on the surface of a sphere by three great circle 
arcs, each less than a semi circle, intersecting pairwise at three points called vertices. 
The length of a side is given by the angle subtended at the centre of the sphere by the 
two rays connecting the centre to the endpoints of the side. The angle at a vertex is 
the dihedral angle between the planes containing the two sides. We begin by proving 
that the angles subtended at the vertices are all less than zr. 


Lemma 2.1. In a spherical triangle, the angles subtended at the vertices are less 
than 1. 
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Proof. Let ABC be a spherical triangle so that AX B is less than a semicircle (Fig- 
ure 1). Complete the great circle AXBY where we have BYA greater than a semi- 
circle. We prove that the angle subtended by the great circle arc AXB at C is less 
than the one by BYA at C. Since the two angles add up to 27, this shows that the 
angle C subtended by AXB is less than z. The antipodal points of A and B (called 
A’ and B’ respectively) lie on the arc BYA since AA’ and BB’ are semicircles while 
the arc AXB is less than a semi circle. The great circle arcs AC and BC, when 
produced, pass through A’ and B’ respectively. Thus, the minor great circle arc B’ A’ 
(contained in B YA) subtends the same angle at C as the arc AX B implying that the 
angle subtended by BYA at C is greater than the one by AXB. 


X 


Figure 1. In a spherical triangle ABC, AB subtends an angle less than z at C 


We now prove an inequality involving the sine function that will be used in prov- 
ing Theorem 2.4. 


Lemma 2.2. If0 <a <2 and0 <k <1, thenk sin(a) < sin(ak). 
Proof. Proving this lemma is equivalent to showing that 


sin(ak) 
k 


—sin(a)>0O or [ costes) —cos(x)) dx > 0. 
0 


We note that0 < kx < x < a since0 <a < x and0 <k < 1. Since cosine 
is monotonically decreasing in the interval (0, 7), this implies cos(kx) > cos(x), 
hence proving the lemma. 
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Below, we deduce a formula for the value of an angle at the base of a spherical 
triangle when the length of its two legs and the angle in between are given. 


Lemma 2.3. Let ABC be a spherical triangle where the sides AB, BC, and CA 
are denoted by c, a, and b respectively. Then, for A # 5, 
2sina 
tan(A) = — 
(4) tan ($C) sin(a + b) —cot (4C) sin(a — b) 
while for A = 5, 
1 
sin(a + b) x tan? (5c) = sin(a — b). 


Proof. We first recall that when A+ B #¢ a anda+b ¥ x, Napier’s analogies give 


cot (5C) cos (4(a — b)) 


1 
tan (54 + B)) = aos (Ata 4 b)) 


and 
cot($C) sin (4(a — b)) 
sin (4(a + b)) , 


Here, notice that 0 < tan (4c ) < oo since 0 < $C < 2. We now proceed to the 


2 2 
proof. 


tan (544 - B)) = 


Case 1: A # 5. We first consider A + B # 7. By the law of sines,a +b = x 
would imply A = B leading to A = 4. Hence, a + b # x. We now rewrite tan(A) 
as tan (3(A + B)+ 3(A — B)) to get the relation 


tan ($(A + B)) + tan (5(A — B)) 


tan(A) = 
1 — tan (3(A + B)) tan (3(A - B)) 
cot (ZC) x FAS 
— 1 in(a—b) 
1 cot4(5C) ee 
2 sin(a) 


= tan (4C) sin(a + b) —cot (4C) sin(a — b) 


When A+ B = 7, the law of sines implies a + b = mz or a = b. However the latter 
is not possible since it implies A = B leading to A = 5. Substituting A+ B = 2 
and a + b = z in the second Napier’s analogy, we find 


tan (4 = =) = cot (2€) x sin@a — 2a) 
2 sin(a) 
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implying 
2 sin(a) 


tan(A) = cot (EC) x sin(b—a) 


Since sin(a + b) = 0, this satisfies the relation given in the lemma. 


Case 2: A= 4. When B # §, then 1 —tan (5(A + B)) x tan (4(A- B)) vanishes 
implying 


1 —cot ( c)— (4(a + d)) . sin (4(a + d)) 


2 
: 2(1 : 
<=> sin(a + b) x tan (5c) = sin(a — b). 
When B = as by the law of cosines, 
0 = cos(A) = —cos(B) cos(C) + sin(B) sin(C) cos(a) = sin(C) cos(a). 
Hence, cosa = 0 implying a = 4. Since A = B, b =a = 4. Thus, sin(a — b) = 


sin(a + b) = 0 so that the implied relation is satisfied. Hence, we have proved the 
lemma. 


We now state and prove a monotonicity result in spherical geometry in the lineage 
of Propositions 28 and 29. The idea is to show that the derivative of the angle involved 
with respect to a side is positive and hence the angle increases when the side increases. 


Theorem 2.4. Let ABC be a spherical triangle with BC > AC. Let D and E be 
points on AC and BC so that CD/CA = CE/CB. If CD < CA, thenCDE < 
CAB. 


Proof. We refer to Figure 2. In a spherical triangle ABC, we denote the sides AB, 
BC, and CA by c, a, and b. Notice that the triangle is completely determined by C, 
a, and b, or equivalently, by C, B and a. Here, 0 < a < a, and by Lemma 2.1, 
0<C <a. We set B = k where 0 < k < 1 since b < a. In the following, we first 
prove that A is a continuous function on the interval 0 < a < a. We then show that 
for A # 5, ga > 0 while holding the angle C and the ratio k fixed. Next, we prove 
that A takes the value > for at most one value of a for fixed C and k. This proves 
the theorem. 
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iC 


A B 


Figure 2. ABC isa spherical triangle, BC > AC and CD/CA = CE/CB 


We set 
D = tan (5) sin(a + ak) — cot (5¢) sin(a — ak). 


By Lemma 2.3, A = + when D = Oand since D is the restriction of a holomorphic 
function, it has a finite number of zeroes. When D ¥ 0, tan(A) = 2sin(a)/D and 
A is continuous in a. Thus, to show that A is continuous on the interval 0 < a < 2, 
we need to show that A is continuous at all points p in the interval where D(p) = 0. 
At such a point p, A = 4. Since 2sina # 0 for0 <a < , limg+» D(a) > 0 
implies tan A — too or A > 4. Hence, A is continuous on the interval 0 <a <z. 


We note that 


a aeaeaen for) <A; 


m + arctan(tan(A)) for F< A<z. 


This implies that for A # F, 


0A 1 : 
a fee ae 


Therefore, for A # 5, 


A 
* (tant) >0=> ol > 0. 
da 
We now show that 2 tan(A) > 0. Since tan(A) = 2 sin(a)/D, this holds true if 


p£e sin(a)) — santa > 0. 
da da 
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The left-hand side of the inequality is equal to D; + D2 where 


D, = 2tan (5c c) (sin(a + ak) cos(a) — (1 + k) cos(a + ak) sin(a)) 


= 2tan € C)(sin(ak) —k sin(a) cos(a + ak)) 
and 


D2 = 2cot (5¢ cya — k)cos(a — ak) sin(a) — cos(a) sin(a — ak)) 


= 2cot € C)(sin(ak) — k sin(a) cos(a — ak)). 


Since 0 < C < , tan ($C) and cot ($C) are both strictly positive. As0 <a <2 
and 0 < k < 1, by Lemma 2.2, we have sin(ak) > k sina and thus 


sin(ak) > k sin(a)cos(a + ak) and sin(ak) > k sin(a) cos(a — ak). 


This implies 2 — ae — + Dz > Osince D; > 0and D2 > 0. Thus, we have 
proved that for A# 5,5, > 0. 

Since A can take the ee 5 at finitely many points, the inequality aa > 0 when 
A # 4 implies that for fixed C andk, A = 4 for at most one value of a. We have 


thus proved the theorem. 


The assumption that the adjacent leg is shorter than the farther one relaxes the 
condition that one angle at the vertices be greater than 4 in Propositions 28 and 29 of 
Menelaus (Theorems 1.2 and 1.3). In particular, in the following corollary, C > + 
has been replaced by CB > CA. 


Corollary 2.5. Let ABC be a spherical triangle and let D and E be be the midpoints 
of CA and CB respectively. Let us join DE. If CB > CA, then CDE < CAB. 


Remark 2.6. In Propositions 28 and 29 of the Spherics, an angle at the vertex was 
assumed greater than 5. In Theorem 2.4, while proving monotonicity of a base angle, 
we made the assumption that the adjacent leg is shorter than the distant one. More 
precisely, CB > CA was needed to show monotonicity of the angle A. We now show 
that these conditions are crucial. 

In Figure 3, CAC’B is a lune. We first fix a constant € < 1. We then find points 
A and B on the boundary of the lune and arbitrarily close to C’ with the condition 
CB = (1—€)CA. Hence, CAB = C’! = C and CBA x nx. Let E be the midpoint 
of CA and F that of CB so that CEF ~ 5. If the angle C were to be less than a 
right angle, it would imply 


CAB C < 


vs 
. 
es 
zy 
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in violation of Propositions 28 and 29. This necessitates the condition that the corre- 
sponding angle at the vertex in a spherical triangle be greater than 7. 

We now find two more points X and Y arbitrarily close to C on great circle arcs 
CAC’ and CBC’ satisfying CY = (1 — «)CX (note that CX/CA = CY/CB). 


Here, CXY = CYX x 4(x —C). Thus, when the angle C is less than of we 


have CXY < CEF but CEF > CAB. Hence, no monotonicity result can be 
established in this setting. We have thus seen that the required conditions on angles 
in Propositions 28 and 29 of the Spherics and that on sides in Theorem 2.4 are crucial. 


Figure 3. C and C’ are antipodal points. E and F are midpoints of CA and CB and CX/CY = 
CA/CB. 


3 Monotonicity in hyperbolic triangles 


In this section, we obtain a theorem on monotonicity in hyperbolic triangles that is 
analogous to Theorem 2.4. Before proceeding, we first prove a relation among angles 
and sides in a hyperbolic triangle that will be used in proving the theorem. 
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Lemma 3.1. Let ABC be a hyperbolic triangle. Let the sides AB, BC, and CA be 
denoted by c, a, and b. Then for A # oe 


sin(C) 
sinh(b) coth(a) — cosh(b) cos(C ) 


while for A = 5, sinh(b) coth(a) = cosh(b) cos(C). 


tan(A) = 


Proof. By the law of cosines, we have 
cosh(a) = cosh(b) cosh(c) — sinh(d) sinh(c) cos(A). 
In this equation, we substitute 


cosh(c) = cosh(a) cosh(b) — sinh(a) sinh(b) cos(C) 


and 
sinh(a) 


sinh(c) = an) 


x sin(C) 


to get 
sinh(a) sinh(b) cot(A) sin(C ) 
= cosh(a) sinh?(b) — sinh(a) sinh(b) cosh(b) cos(C ). 
Dividing both sides by sinh(a) sinh(b), we obtain 
cot(A) sin(C) = coth(a) sinh(b) — cosh(b) cos(C). 
This implies that for A = 4, coth(a) sinh(b) = cosh(b) cos(C) while for A # 4, 


sin(C) 
sinh(b) coth(a) — cosh(b) cos(C)” 


tan(A) = 


We now state and prove the theorem on monotonicity in hyperbolic triangles. 


Theorem 3.2. Let ABC be a hyperbolic triangle with BC > AC. Let D and E be 
points on AC and BC so that CD/CA = CE/CB. Furthermore, let CD < CA. 
Then, CDE > CAB. 


Proof. In Figure 4, we set BC = a and CA = Db. Proceeding similarly to the 
spherical case, it follows from Lemma 3.1 that A is continuous in a. We first show 
that for A # 5, a < 0 while holding the angle C and the ratio B fixed. We then 
show that there is at most one value of a for fixed C and b for which A = on This, 
along with continuity of A in a proves the theorem. 
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By Lemma 3.1, for A # 4, we have 


sin(C) 


tan) ~ Sah(B) coth(a) — cosh(b) cotC) 


Using quotient rule and setting B = k =constant, 


k cosh(ak) coth(a) — (k cos(C) + csch?(a)) sinh(ak) si 


0 
Tiare ~ (sinh(b) coth(a) — cosh(b) cos(C))? i 


Since 0 < C <x, sinC > 0. Thus, # tan(A) < Oif 


k cosh(ak) coth(a) — (k cos(C) + esch?(a)) sinh(ak) > 0. 


Cc 


Figure 4 


As the left-hand side is linear in cos(C), we need to show that 


As = kcosh(ak) coth(a) — (+k + esch?(a)) sinh(ak) > 0. 


However, A+ > Oimplies A- > A+ > 0 since sinh(ak) > 0. To prove A+ > Ois 
equivalent to showing A = (sinh?(a)) A, > 0. We find 
JA 
van (2 — k)k sinh(a) sinh(a — ak) 
a 


so that the derivative never vanishes when 0 < k < 1 and 0 <a < oo. This implies 
that A does not have a local extremum in this region. At k = 0 or 1, A = 0 while 


limA=0O and lm A=o. 
a—>0 a—>oo 


This implies that A does not change sign in the region 0 < k < 1,0 <a < ow and 
hence A > 0. We have now shown that ga < Ofor A # F. 
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We now show that for fixed C and B. there is at most one value of a for which 
A= 5 and that A is a continuous function of a. We note that, by Lemma 3.1, A = 5 
when 


sinh(b) coth(a) — cosh(b) cos(C) = 0 <> tanh(a) x tanh(ak) = cos(C). 


Since both tanha and tanhak are positive and monotonically increasing functions 
fora > O and 0 < k < 1, their product is also an increasing function. Therefore, 
tanh(a) x tanh(ak) = cos(C) has at most one solution for fixed values of k and C. 
We have thus shown that ga < Ofor A # S and that A = S for at most one value 
of a for fixed C and B. Hence, we have proved the theorem. 
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1 Introduction 


The fifth Euclidean postulate has been a subject of controversy between mathemati- 
cians regarding its necessity and independence from the other axioms since Euclid 
stated it over two thousand years and until the discovery of non-Euclidean geometry 
in the 19" century. Many famous geometers and philosophers tried to show that this 
axiom is a consequence of the other axioms, without success. In 1733 Girolamo Sac- 
cheri [4],, the Italian priest and mathematician who taught theology and mathematics 
at the University of Pavia, considered the first four Euclidean postulates together with 
the negation of the fifth, and tried to show the inconsistency of this system of pos- 
tulates, and thus to prove the necessity of the fifth postulate, that is, to prove that it 
logically follows from the other axioms. Saccheri thought he succeeded to realize 
his idea but he made a mistake in his proof. In 1766, Johann Heinrich Lambert pub- 
lished a similar investigation, but he did not find any contradiction in the new system 
of postulates and did not emit any definite conclusion; see [2] and [3] for the details. 
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The fifth postulate is equivalent to the fact that the sum of the angles in a triangle is 
equal to z. Lambert knew that the angle sum in a spherical triangle is greater than zr, 
and that the area of a spherical triangle is, up to a constant factor, equal to the angle 
sum minus zr. In his 1766 memoir, he made an extensive study of quadrilaterals with 
three right angles in the three geometries (the Euclidean, the spherical and the hyper- 
bolic), which are called today Lambert quadrilaterals, or Ibn al-Haytham—Lambert 
quadrilaterals. He showed that the angle sum in a triangle in the (hypothetical) 
hyperbolic plane must be less than zr, and that the area of such a triangle has to 
be proportional to the angle defect, that is, 7 minus the angle sum, a situation anal- 
ogous to the one in spherical geometry but where inequalities have to be reversed: 
angle sum is greater than wz and area is proportional to angle excess, that is, angle 
sum minus 7. 

Joseph-Marie de Tilly (1837-1906), a Belgian artillery colonel and professor at 
the Royal Military School in Brussels worked extensively on non-Euclidean geom- 
etry. After Janos Bolyai, Carl Friedrich Gauss, and Nikolai Ivanovich Lobachevsky 
and independently of them, he rediscovered hyperbolic geometry, in the 1860s. Like 
his famous predecessors, de Tilly found a geometry where the Euclidean fifth postu- 
late is replaced by its negation, but unlike them, he used reasonings from mechanics 
in his investigation of this new geometry which incidentally he called “abstract ge- 
ometry”’! in order to distinguish it from from the “real” Euclidean geometry and from 
Lobachevsky’s “imaginary geometry,” although the Lobachevsky geometry and his 
geometry amount to the same. 

The key idea of de Tilly was to use a velocity-addition formula which was known 
in kinematics, assuming that at every point of the two-dimensional space under con- 
sideration, the tangent space is the vector space R? with the standard Euclidean 
norm.? In his 1870 paper titled “Etudes de mécanique abstraite” [1], he applies his 
kinematical approach to the study of non-Euclidean geometry. In the first and second 
parts of his paper, de Tilly applies his method to find several formulae relating the 
length of a curve at distance r from a geodesic (a function he calls eq(r)) and of the 
length of a circle of radius r (a function he calls circ(r)) on his “abstract” 2-plane. 
Combining the obtained relations, he gives an original proof of formulae (3) below in 
hyperbolic geometry, and he also deduces the Laws of Sines and Cosines in spherical 
space. It is important to note that by the time he was working on this subject, the 
Poincaré model of the hyperbolic plane was not yet known, and de Tilly could not 
work in models. He used other methods we shall explain below. The third part of his 
paper is mostly dedicated to the basics of mechanics in the new “abstract geometry.” 
For instance, de Tilly studies compositions of translations and rotations and estab- 


' “Géométrie abstraite” in French. 
2 We are using here modern mathematical terminology. 
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lishes the formulae for the transition from one inertial reference frame to another. 
A quick overview of de Tilly’s paper is given in [5, pp. 85-110]. 

It is may be useful to recall that the terminology and notions universally known 
and accepted by mathematicians has changed since the 19"" century. Therefore not 
all proofs in de Tilly’s paper [1] are clear today, although there is no doubt that this 
paper was totally understandable by the mathematicians 150 years ago. In Section 2, 
we give formulae for the functions eq(r) and circ(r) in the Euclidean, hyperbolic, 
and spherical cases. Alternatively, it is not difficult to obtain them by direct calcu- 
lations with the use of modern techniques. In Section 3, we give several examples 
of de Tilly’s reasoning to explain the main principles. Quite standard manipulations 
combined with the results of Sections 2 and 3 imply a unified proof of the Laws of 
Sines and Cosines in both hyperbolic and spherical spaces. This is the content of 
Section 4. 


2 Functions circ and eq: modern calculation 


Following de Tilly’s paper [1], let us introduce, in the Euclidean, hyperbolic, or spher- 
ical plane, the functions 


* circ(r) equal to the length of a circle of radius r > 0, defined whenever the 
corresponding circle exists; 


* eq(r) equal to the length of a curve at distance r > 0 from a geodesic segment 
of length 1, again defined whenever the corresponding equidistant curve exists. 


In what follows, when we talk about a “space,” it is understood that it is 2- 
dimensional. 

For the Euclidean space, let us denote the corresponding functions by circr(r) 
and eqp(r), for the spherical space by circg(r) and eqg(r), and for the hyperbolic 
space by circyy(r) and eqy;(r). 

It is elementary that in Euclidean space we have the formulae 


circrn(r) = 22r and eqp(r) = 1 (1) 


for all r > 0. 
A direct calculation on a sphere of curvature K = M*, M > 0, gives 


circs(r) = = sin(Mr) and eqg(r) = cos(Mr) (2) 


defined for 0 < r < ra and0Q<r< ie respectively. 
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A similar calculation in the Poincaré model of the hyperbolic plane of curvature 
K =—M2,M > 0, shows that 


circa (7) = = sinh(Mr) and eqy(r) = cosh(Mr) (3) 


defined for all r > 0. 
Thanks to (1)-(3) we can naturally extend the functions eq and circ to the whole 

real line R. Unifying (1)-(3) we conclude that for a space of constant curvature 

K € R the following formulae hold: 

20 


VE 


r € R (for K <0 we consider the branch /K=i /|K)). 
By formulae (4), in a space of arbitrary constant curvature K, K € R, the “ex- 
tended” functions eq and circ satisfy the following properties: 


sin(V Kr) and eqx (r) = cos(VKr), (4) 


ciren (r) = 


circ(r)= —circ(—r) and eq(r)=eq(—r), forallr € R. (5) 


In his work, de Tilly supposed that, in contrast to the Euclidean case, the space he 
considered verifies either eq?(r) > 1 or 0 < eq?(r) < 1. These inequalities easily 
follow from (2) and (3). Like de Tilly, in what follows, we shall deal exclusively with 
his hypotheses on circ(7) and eq(r) and we shall not use the explicit expressions (2) 
and (3). 


3 The functions circ and eq: ancient approach 
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In this section, we give several of de Tilly’s “mechanical” ideas applied to geome- 
try. This will allow us to obtain formulae relating the functions eq(r) and circ(r) 
commonly in the Euclidean, hyperbolic, and spherical spaces without the use of any 
model for these spaces. 

Let us recall two main kinematic principles used by de Tilly. 


Principle 1. If a point P moves along a geodesic / with a constant speed v then the 
absolute value v’ of the velocity of a point P’ moving along an equidistant 
curve /’ at distance r from / such that the geodesic line passing through P and 
P’ remains orthogonal to / at any time satisfies the formula 


v' = veq(r). 


Note that by the definition of the function eq(r), if P passes any distance d 
along /, the point P’ should pass the distance d eq(r) along /’. The velocity of 
P’ is tangent to /’ and points in the direction of the movement of P’. 
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Principle 2. Let us consider the rotation of a point P at a constant speed wp around 
a circle of radius | centered at a point O. Define the angular velocity w around 
O as follows: the absolute value of w is equal to wp; w is positive if P rotates 
in a clockwise direction, and is negative otherwise (in contrast to the standard 
definition of the angular velocity). Let us now consider the rotation of a point 
P’ at distance r around O with the angular velocity w. By definition, a full turn 
around O is performed at the time cre) As the length of a circle of radius 
r is circ(r), we conclude that the speed of P’ is oo. The velocity vector 

of P’ is tangent to the circle of rotation of P’ and it points in the direction of 

rotation of P’. 


Note that both principles deal only with the first four Euclidean axioms and do 
not use the parallel postulate. 


3.1 Moving along equidistant lines 


Let /o and /4z be two perpendicular straight lines intersecting each other at O, and 
let A and B be two points on /4z such that A lies between O and B. Denote by a 
the length of the segment OA, and by b the length of the segment AB. 

Let us move the point O together with the line /4z along the geodesic /g in such 
a way that /4g remains orthogonal to /9 during the translation. Thus the points A 
and B move along the curves c4 and cg equidistant from /9 at distances a anda-+ b, 
respectively. The velocity vector t of O at any time fo is tangent to /g. Denote by v 
the length of v. Since we consider a translation along /g, the angular velocity around 
O is zero. Also, the velocity vectors U4 and Ug of the points A and B at the time 
to are tangent to the curves c4 and cg, respectively. Denote by vg the length of v4, 
by vg the length of tg, by wa the angular velocity around A that (together with v4) 
provides the moving of A along c4 as O moves along /g, by wg the angular velocity 
around B that (together with Ug) provides the moving of B along cg as O moves 
along /o (see Figure 1). 

Let us consider the mutual movement of the points O and A. By Principle 1, the 
speed of the point A induced by the translation of the point O is 


va = veq(a). (6) 


On the other hand, the movement of O induced by the movement of the point A can 
be decomposed into the rotational and the translational parts. If the point A only 
translates along a geodesic line with a tangent vector v, at A, then, as previously, in 
order to keep the line /4g orthogonal to such a geodesic, the point O should move 
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Figure 1. Moving along equidistant lines 


orthogonally to 14g with speed v4 eq(a) in the same direction as the point A does. 
We will call it a translational component of the motion of O. When A is fixed and /4z 
rotates around A with an angular velocity w, then, by Principle 2, O moves along 
oni tangent to 
the considered circle, i.e orthogonal to /4z. We will call it a rotational component 
of the motion of O. Since an angular velocity is positive if the rotation is clockwise 


a circle centered at A of radius a with velocity vector of length w, 


and since the point O is located “under” A on Figure 1 (we can always turn the plane 
containing the lines /4g and /9 so that the point O becomes “under” A), the rotational 
component of the motion of O has an opposite direction to the vector v4. Thus, 
the rotational and the translational components of the motion of O have opposite 
directions. Therefore, according to the kinematic velocity addition rule, we can write 


v= nea) ” 
Similarly, the movement of B viewed from the point O is described by 
vp = veq(a + 5), (8) 
the movement of O viewed from the point B can be written as 
v= ‘edge (9) 


circ(1) 


De Tilly’s mechanical view on hyperbolic and spherical geometries 99 


the movement of B viewed from the point A is described by 


circ(b 
va = v40q(b) + og, (10) 
circ(1) 
and the movement of A viewed from the point B can be written as 
circ(b 
v4 = vp eq(b) — op. a) 
circ(1) 
Substituting (6) into (7), we get 
WA 
—- -1 12 
circ(1) on a=), ue 
Substituting (8) into (9), we get 
a) 
= (eq?(a + b) — 1). (13) 


cire(I) TCE 
Substituting (6), (8), and (13) into (11), we obtain 
circ(b) 
circ(a + b) 


Although it is not used in what follows, notice that if we set p=a+b, w=b, p—W=a, 
then the previous formula becomes 


eq(a) = eq(h) eq(a + b) — (eq’(a + b) — 1). 


cay — W) = ea(Weate) ~ (ea?(p) ~ 1), (14) 
Next, substituting (6), (8), and (12) into (10), we obtain 
eq(a +b) = ea(a)ea(b) + TA (eq2(a) 1). (15) 
By symmetry, if we change the roles of a and b in equation (15), 
ea(a +b) = ea(a)e4(b) + 19 (eq?(b) ~ 1). (16) 
Expressing S44 from (15) and (16), we obtain 
(eq(a + b) — eq(a) eq(b))? = (eq2(a) — 1)(ea2)- 1). 17) 


Remark that for every r > 0 such that there exists a circle of radius r in the 
given space, the function circ(r) is positive, by definition. Looking at (15), we notice 
that expression eq(a + b) — eq(a) eq(b) is not positive in the spherical case (since 
eq?(a) < 1 in this case, see (2)), and it is not negative in the hyperbolic case (since 
eq? (a) => 1 in this case, see (3))). Therefore, the identity (17) implies 
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(s) in the spherical case 
eq(a + b) = eq(a)eq(b) — Y(eq?(a) — Hewr(b)- 1), 8) 
(h) in the hyperbolic case 


eq(a + b) = eq(a) eq(b) + v (eq?(a) — 1)(eq*() — 1) (19) 


for all a and b where the functions circ and eq are defined. Using (2) and (3), we 
conclude that formulae (18) and (19) hold for all a, b € R. 


Corollary 3.1. Taking into account formulae (2) and (3), expressions (14), and 
also (15), (16), and (18) considered in spherical space generalize the formulae 
for cos(a — b) and cos(a + b). Moreover, expressions (14), and also (15), (16), 
and (19) considered in hyperbolic space generalize the formulae for cosh(a —b) and 
cosh(a + b). 


3.2 Simultaneous rotation of two circles around their common center 


Let /4z be a straight line, and let O, A, and B be three points on /4z such that A lies 
between O and B. Denote by a the length of the segment OA, and by b the length 
of the segment AB. 

Let us now fix the point O (therefore, the velocity vector 0 of O at any time 
to is zero) and let us rotate /4g around O with the angular velocity w. Thus the 
points A and B move along the circles c4 and cg centered at O of radii a anda + b, 
respectively. By Principle 2, the velocity vectors v4 and Ug of the points A and B at 
any time fg are tangent to the circles c4 and cg, respectively. Denote by vy the length 
of v4, by vg the length of vg, by wa the angular velocity around A that (together 
with v4) provides the rotation of A around O along c,, by wg the angular velocity 
around B that (together with vg) provides the rotation of B around O along cg (see 
Figure 2). 

Reasoning in a similar way as we did in order to obtain equations (7)—(11), we get 
that the movement of the point A viewed from the point O is described by equation 


___ cire(a) 
a © cire(1) a 
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Figure 2. Simultaneous rotation of two circles around their common center 


Similarly, the movement of O viewed from the point A can be written as 
circ(a) 
v4 eq(a) — w4— =0, 
circ(1) 


circ(a + b) 


the movement of B viewed from the point O is described by 
iad circ(1) 


the movement of O viewed from the point B can be written as 
circ(a + b) 0 


up eq(a + b) — ap cire() 


circ(b 
vg = vaeq(b) + wa meas 


the movement of B viewed from the point A is described by 


circ(b) 


and the movement of A viewed from the point B can be written as 
v4 = vg eq(b) —@ : 
A = up eg(>) — op —— (1) 


(21) 


(22) 


(23) 


(24) 


(25) 
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Substituting (20) into (21), we get 
WA = weq(a). (26) 
Substituting (22) into (23), we get 
Op = weq(a+t+b). (27) 
Substituting (20), (22), and (27) into (25), we obtain 
circ(a) = eq(b) circ(a + b) — circ(b) eq(a + Bb). 


Although it is not used in what follows, notice that if we set p=a+b, w=b, p—w=a, 
then the previous formula can be rewritten as follows: 


circ(p — W) = cire(y) eq(W) — eq(¢) circ(y). (28) 
Substituting (20), (22), and (26) into (24), we obtain 
circ(a + b) = circ(a) eq(b) + eq(a) circ(b). (29) 


Corollary 3.2. Taking into account formulae (2) and (3), expressions (28), and 
also (29) considered in spherical space generalize the formulae for sin(a — b) and 
sin(a + b). Moreover, expressions (28), and also (29) considered in hyperbolic space 
generalize the formulae for sinh(a — b) and sinh(a + b). 


3.3 Kinematics of a right-angled triangle 


Let us consider a right-angled triangle A A BC ina plane of constant curvature K ¢ R 
with angles ZA = i ZB, ZC at vertices A, B, and C, and with lengths a, b, and 
c of the edges BC, AC, and AB, respectively. 


Lemma 3.3. Given a triangle AABC as above, then 
circ(b) = circ(a) sin(ZB) and circ(c) = circ(a) sin(ZC). (30) 


Proof. Let us consider a rotation of A ABC with angular velocity w around the ver- 
tex C (see Figure 3). 
By Principle 2, the velocity of the vertex A is a vector of length w 


circ (a) 
circ(1) 


circ(b) 
circ (1) 


orthogonal to CB. 


orthogonal 


to CA and the velocity of the vertex B is a vector of length w 
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circ(a) 
circ(1 


circ(b) 
7 circ(1) 


Figure 3. Rotation of a right-angled triangle 


Since the length c of the edge AB does not change during the rotation, the pro- 
jections of the velocity vectors of A and B on the line AB are equal, i.e. 


circ(b) ___cire(a) 
circ(1) Cire (1) 
which is equivalent to the first formula from (30). 


The same reasoning applied to AABC turning around the vertex B leads to the 
second formula from (30). 


sin(ZB), 


Lemma 3.4. Given a triangle AABC as above, then 


_ cos(ZB) 
~ sin(ZC) 


Proof. Let us slide A ABC along the line A with speed v (see Figure 4). 

By Principle 1, the velocity of the vertex B is a vector of length v eq(c) orthogonal 
to AB. 

Since the length a of the edge BC does not change during the movement of 
A ABC, the projections of the velocity vectors of B and C on the line BC are equal, 
Le. 


and eq(c) = cate’) 


40) ~ Sin(ZB) 


(31) 


vcos(ZC) = veq(c) cos (> = ZB) (= veq(c) sin(ZB)), 


which is equivalent to the second formula from (31). 
The same reasoning applied to AABC sliding along the line AB leads to the first 
formula from (31). 
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veq(c) 


A v b C v 


Figure 4. Translation of a right-angled triangle 


Lemma 3.5. Given a triangle A ABC as above, we have 


cos(ZB) cos(ZC) 


eq(a) = cot(ZB) cot(ZC) = an(ZC) antZB) 


=eq(b)eq(c). — (32) 


Proof. Let us draw a segment AD, D € BC, such that AD is orthogonal to BC, 
and leta=Z BAD. 
Applying Lemma 3.4 to the triangles AABD and AACD, we get: 


cos(ZB) _ cos(ZC) 


a sin(a)  cos(a) * 


Hence, 


cos(ZB) 
tana = ———. 
cos(ZC) 


By construction, a < 5, ZB < 5,and ZC < 4, and we can apply expressions 


tan(a) 1 


a ee, oer 
J 1+ tan?(a) me ee J 1+ tan?(a) 


in order to obtain formulae 


sin(a) = 


cos ZC cos(ZB) 


— sin (x) = ——_____.. 
Jcos2(ZB) + cos2(ZC) cos2(ZB) + cos2(ZC) 


cos(a) = 
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Therefore, by Lemma 3.4 applied again to the triangles AABD and AACD, we 


have 
cos(a) cos(ZC) 


BD) = = 22 
ae?) sin(ZB)  sin(ZB)./cos2(ZB) + cos2(ZC) 7 

ane sin(@) cos(ZB) 
eq(DC) = = (34) 


sin(ZC) sin(ZC) ,/cos2(ZB) + cos2(ZC) 


Recall that, from expressions (18) and (19) it follows that 


(s) in the spherical case 

eq(a) — eq(BD) eq(DC) = — V(eq?(BD) = 1)(eq2(DC)—= 1), (35) 
(h) in the hyperbolic case 

eq(a) — eq(BD) eq(DC) = V(eq?(BD) — 1(eq2(DC)=1). (36) 


By formulae (33) and (34), 


cos(ZC) cos(ZB) 


Sa) SBD VL) = Sa) > Saere an ZB eos ( ZB) eo ON 
(37) 


and, also, 


v (eq?(BD) — 1)(eq?(DC) — 1) 
JOR erase Zo-y 


~  sin(ZC) sin(Z B)(cos2(Z B) + cos2(ZC)) 


The angle sum in any triangle is greater than 27 in the spherical case, and is 
less than 27 in the hyperbolic case. Hence, for the right-angled triangle AABC 


with ZA = 5 we get ZC > 5 — ZB in spherical space, and ZC < 5 — ZB in 
hyperbolic space. Thus, 


(s) in the spherical case 


cos?(ZB) + cos(ZC) — 1 < 0, 


(h) in the hyperbolic case 


cos*(ZB) + cos?(ZC) — 1 > 0. 
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Therefore, formula (38) gives 


(s) in the spherical case 


— v (eq?(BD) — 1)(eq?(DC) — 1) 
_ cos(ZC) cos(Z B)(cos?(Z)B + cos?(ZC) — 1) (39) 


~  sin(ZC) sin(ZB)(cos2(ZB) + cos2(ZC)) ’ 


(h) in the hyperbolic case 


Vv (eq?(BD) — 1)(eq?(DC) — 1) 
_ cos(ZC) cos(Z B)(cos?(ZB) + cos*(ZC) — 1) (40) 
~ sin(ZC) sin(ZB)(cos?(ZB) + cos2(ZC)) 


Replacing the right-hand and the left-hand sides of equations (35) and (36) by 
expressions (37), (39), and (40), for both hyperbolic and spherical cases we obtain 
the same formula: 


eq(a) = cot(ZB) cot(ZC). (41) 


It remains to apply formula (31) from the statement of Lemma 3.4 to equation (41) 
in order to get expression (32). 


Corollary 3.6. Taking into account formulae (2) and (3), Lemma 3.5 can be read as 
the Pythagoras theorem in both hyperbolic and spherical spaces. 


Lemma 3.7. Given a triangle A ABC as above, then 


eq(a) _ circ(a) 
eq(b) _ circ(b) 


eq(a) __ circ(a) 
eq(c) _circ(c) 


cos(ZC) and cos(ZB). (42) 


Proof. By Lemma 3.5 we know that 
eq(a) = cot(ZB) cot(ZC). 
Also, by Lemma 3.4 it is true that 


cos(ZB) 


eq(b) = am ZCy" 
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Therefore, 
eq(a) = cos(ZC) (43) 
eq(b) — sin(ZB) 
We also know from Lemma 3.3 that 
1 __ cire(a) (44) 


sin(ZB)  circ(b)” 


Substituting (44) into (43), we get the first identity (42). Exchanging roles of b 
and c and also of B and C, we obtain the second identity (42). 


4 Corollaries 


Let us consider an arbitrary triangle AF GH ina plane of constant curvature K € R 
with angles 7ZF(< 2), ZG(< 2), ZH(< 7) at the vertices F, G, and H, and 
with lengths f, g, and h of the edges GH, HF, and FG, respectively. 


Lemma 4.1. Given a triangle AF GH as above, then 


circ(f) _ cire(g) __—_cire(h) 


sin(ZF)  sin(ZG)  sin(ZH)’ we) 


Proof. Let us take a point 7 on the straight line GH such that the segment F 7 
is orthogonal to GH, and let tr denote the length of F 7. Applying Lemma 3.3 to 
ATF FH, we get 


circ(tr) = cire(g) sin(ZA) 
if the point H does not lie between the points G and 7 on the straight line GH, and 
circ(tr) = circe(g) sin(a — ZH) 


if the point H lies between G and Tr on GH. Since sing = sin(z — @) for any 
gy € R, formula 


circ(tr) = cire(g) sin(ZA) (46) 


holds in both cases. 
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Similarly, applying Lemma 3.3 to ATr FG, we obtain 
circ(tr) = circ(h) sin(ZG). (47) 
Mixing (46) and (47), we get 


circ(g) — circ(h) 


sin(ZG)  sin(ZH)’ ee) 


By analogy, if we take a point Tg on the straight line FH such that the seg- 
ment G7¢g is orthogonal to FH and then apply Lemma 3.3 to the triangles ATg GH 
ATgGF, we obtain 

circ(f) __circ(h) 
sin(ZF)  sin(ZH)’ 

The equalities (48) and (49) imply (45). 


(49) 


Corollary 4.2. Taking into account formulae (2) and (3), Lemma 4.1 can be read as 
the Law of Sines in both hyperbolic and spherical spaces. 


Lemma 4.3. Given a triangle AF GH as above, then 


(s) in the spherical case the following relation holds: 

eq( f) = eq(g)eq(h) + vy (eq*(g) — 1)(eq?(h) — I) cos(ZF); (50) 
(h) in the hyperbolic case the following relation holds: 

eq(f) = eq(g) eq(h) — V(eq?(g) — I(eq?(h) — 1)(cos ZF). (51) 


Proof. Remark that Lemma 4.3 reduces to Lemma 3.5 if ZF = 4. 

Let us now consider the general situation, that is, when0< ZF < mand ZF #4 oe 

Take a point Kg on the straight line FH such that the segment GKg is orthogonal 
to FH. Let kg denote the length of GKg, g; denote the length of FKg, and go 
denote the length of KgH. 

Let us first obtain formula 


circ (h) 
circ(g) 
Applying Lemma 3.5 to AFGKg and to AHGKg, we get 


eq(f) = eq(g) eq(h) — (eq*(g) — 1) cos(ZF). (52) 


eq(h) = eq(gi) eq(ka), (53) 


eq(f) = eq(g2) eq(ka). (54) 
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Let us study three possible cases. 


Case 1. Assume that ZF < 4, ZH < 4. Therefore Kg belongs to the segment 
FH and g; + g2 = g. Substituting g2 = g — g, into (54), applying relation (14) to 


eq(g — g1), and taking into account formula (53), we obtain 


cire(gi) eq(h) 


2 — 
ciro(g) eq(gi) 4 8) ~ ea 


eq( f) = eq(g) eq(h) — 


Applying Lemma 3.7 to AF GKg we get 
eq(h) _ circ(h) 
eq(gi) —_cire(g1) 

Substituting (56) in (55), we obtain (52). 


cos(ZF). (56) 


Case 2. Assume that ZF < 5, ZH > 4. Therefore H belongs to the segment 
FKge and gi = g + gz. Substituting g2 = gi — g into (54), recalling that, by (5), 
eq(g1 —g) = eq(g — 21), and then repeating step by step what we have already done 


in Case 1, we obtain (52). 


Case 3. Assume that 7F > 4. Therefore F belongs to the segment Kg H and 


go = g + gi. Substituting it into (54), applying relation (15) to eq(g + gi), and 
taking into account formula (53) we obtain 


cire(gi) eq(h) 


2(g) —1). 57 
—Oaao ae 


eq(f) = eq(g) eq(h) + 


Applying Lemma 3.7 to AF GKg and considering that 2GF Kg = x — ZF we get 


eq(h) = an) cos(ZGFKg) = a Ste =- ane) 
eq(gi) —_circ(g1) cire(g1) cire(g1) 


cos(Z F). 


(58) 
Substituting (58) in (57), we obtain (52). 


Thus, formula (52) is proved. 


Similarly, if we take a point Ky on the straight line FG such that the segment 
H Ky, is orthogonal to FG and repeat the previous reasoning, we obtain 


ea(f) = ea(g) eq(h) — AB (eq?(hy— eostZF). 5) 
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Expressing eq(g) eq() — eq(f) from (52) and (59), we get 
circ(g) 


ene) (eq?(g) — 1) cos(Z FF) = ——(eq?(h) — 1) cos(ZF), 
circ(g) circ(h) 


. _ eq?(h) — 1 
cire(g)/ —eq?(g) — 1 
Recall that, by definition, circ(r) > 0 for r > 0. Therefore, 


circ(h) —_feq?(h)—1 
cio(g) — V eq(g)—T - 


Substituting (60) into (52), we obtain 


eq?(h) — 1 


“gat et 8) — Deo(ZF). 1) 


eq( f) = eq(g) eq(h) — 


As eq?(a) < 1 in the spherical space and eq?(a) > 1 in the hyperbolic space (see 
formulae (2) and (3)), the identity (61) implies (50) in the spherical case and (51) in 
the hyperbolic case. 


Corollary 4.4. Taking into account formulae (2) and (3), Lemma 4.3 can be read as 
the Law of Cosines in both hyperbolic and spherical spaces. 
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1 Introduction 


The Gauss—Bonnet theorem is one of the most beautiful equations in mathematics. 
It says that for an orientable closed surface S in Euclidean space R?, the “total cur- 
vature” is equal to 27 the Euler characteristic of S. That is, 


| Kdo = 27 x(S). 

S 
The left-hand side is the integral of K over the whole surface S with respect to the 
area form do; here K is the Gaussian curvature at each point on the surface. The 
Gaussian curvature is a geometric quantity which changes as you bend, stretch or 
scale S in R?. On the other hand, the Euler characteristic y(S) is a topological 
invariant, a quantity that remains constant no matter how you deform the surface. 
What makes the Gauss—Bonnet formula remarkable is that it elegantly connects these 
two very different worlds of geometry and topology. 

The above global Gauss—Bonnet theorem follows directly from the local Gauss— 

Bonnet theorem; we state a simplified version of it below: 
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Theorem 1.1 (Gauss). Let S be a surface in R? and let T = A(A, B,C) be a 
geodesic triangle in S, that is, T is a simply connected subset of S which is bounded 
by three geodesic segments connecting three points A, B,C. Then 


I Kdo =(ZA+ZB+ZC)—-1z. 


T 


Here ZA, ZB, ZC are the interior angles of the triangle. Thus, the right-hand 
side is the angle excess of T. Indeed, since T is not a planar triangle, we should 
expect its interior to have non-zero total curvature. In that case the theorem proves 
that the sum of its angles is not z, and how different it is from 2 depends on curvature 
and area of 7. Important examples are when K is constant, in which case the angle 
excess of a triangle T is a constant multiple of its area. 

Our goal is to give the reader an informal understanding of the concepts related 
to the above formula without developing the machinery for a rigorous proof which 
can be found in many textbooks. The references [1] and [4] are good sources for 
a complete treatment of the subject. In Section 2, we define geodesic curvature of 
curves on surfaces and demonstrate its relation with angles on a piecewise geodesic 
curve. In Section 3, we informally explain the Gaussian curvature and provide a 
heuristic proof of Gauss—Bonnet formulas. 

It is enlightening to know the history of how these ideas developed. Given a 
surface S smoothly embedded in R?, at each point p € S there is a tangent plane 
T,S containing all tangent vectors, and there is a unit normal vector N, » perpendicular 
to the surface. Since S is orientable there is a coherent and continuous way to choose 
N, p at every point p. Pick a unit tangent vector v € TpS. There is a unique plane P; 
containing both Ny and v, and this plane intersects S along a curve y containing p. 
Locally around p we can parametrize this curve y: [—e, €] > S such that y(0) = p, 
y’(0) = B, and such that it is unit speed. Of course the first derivative y’(t) gives 
us tangent vectors. We can differentiate again to get the second derivative y”(t) of 
acceleration vectors. Then let us define «(U) to be the dot product Nyy’ "(0). This 
quantity «(v) will be called the curvature in the v direction at p. Since the curve y 
is on the plane P35, it is easy to see that k(V) = +||y”(0)|| and the sign depends on 
whether y” (0) and N, » point to the same or opposite direction. The following theorem 
is proven by Euler [2]; it precedes Gauss’ work on surfaces. 


Theorem 1.2 (Euler). Given the above notations, if k(b) are not all the same as 3 
varies in the tangent plane, then there is a unit tangent vector 0, such that ky = 
K(01) is the minimum, and a unit tangent v2 such that kz = K(¥2) is the maximum. 
Moreover, v1 and v2 are perpendicular. 
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Here k1, kz are called the principal curvatures of S at p, and vj, v2 are the cor- 
responding principal directions. 

Gauss defined surface curvature as follows [3]. Let $? be the unit 2-sphere in R?. 
For a smooth surface S C R3, let G: S + S$? be the map taking a point p € S to its 
unit normal vector Ny. This is called the Gauss map. We have the differential map 
dGp:TyS — Tg p)8? where T,S is the tangent plane to S at p and Tsp)S? is the 
tangent plane to the unit sphere at S(p). 


Definition 1.3. Let 01, v2 be tangent vectors at p. The Gaussian curvature at p is 
defined to be 


we Area of the parallelogram spanned by S(v1), S(v2) 


Area of the parallelogram spanned by 0}, v2 


Of course this is an informal version of a more rigorous definition. We can think 
of K as the scaling factor of the Gauss map on infinitesimal area near p. 

It is then showed that the curvature K at p equals the product of the principal 
curvatures, that is, K = k,k2. This result gives us a geometric understanding of 
curvature of surfaces. At a point on S, K is positive if its principal curvatures are of 
the same sign, which is when the principal tangent curves bend in the same direction 
at p; K is negative when the principal tangent curves bend in opposite directions; and 
K is zero when one of the principal tangent curves is flat at p. Figure 1 demonstrates 
the local picture around points of positive, negative, and zero curvatures. 


(a) (b) (c) 
Figure | 


When a surface is smoothly embedded in R?, it inherits an infinitesimal metric 
which is given in coordinates by the 1 fundamental form. This metric, later known 
as the Riemannian metric—a notion of distance on the surface—exists without the 
need of embedding the surface into 3-space. Gauss formulated a more sophisticated 
description of K in terms of the 1‘ and 24 fundamental forms, which in turn can 
be computed purely using the 1“' fundamental form. This result is called Gauss’ 
Theorema egregium [3], translated by “remarkable theorem.” Indeed the theorem 
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shows that the curvature K depends only on the Riemannian metric of the surface, 
hence it is an intrinsic property of the surface and it can be computed purely by 
measurements of distances and angles on the surface. This result and the works of 
Bernhard Riemann that follows mark the beginning of Differential Geometry, a very 
important and beautiful area of mathematics. 

Assuming the Theorema Egregium, in Section 3, we will use an informal intrinsic 
definition of the Gaussian curvature to prove the local Gauss—Bonnet theorem. 


2 Geodesic curvature and geodesics 


We will now define the notion of geodesics which will be important to understand 
curves on surfaces. Let S be a smooth surface in 3-space. Consider a smooth curve 
y:|[a,b] — S of unit speed, that is, ||y’(t)|| = 1. At the point y(r), the curve’s 
tangent vector is given by y’(t) and its acceleration vector is y(t). Since the curve 
is unit speed, the acceleration vector is perpendicular to the tangent vector. (One can 
see this by moving the base point of vector y’(f) to 0, so that y’(t) becomes a path on 
the unit sphere, which makes the vector y’”’(¢) tangent to this path at the point y’(r), 
hence y’(t) L y’(t).) Now let (y/(t))~ be the plane of all vectors perpendicular to 
y(t). 

We have a natural orthonormal basis for (y’ (t))+ given by {Nya Nya x y(t}. 
Recall that NV. y(t) is the unit normal vector to the surface and N. y(t) X y(t) is the cross 
product which i is perpendicular to both Nya) and y’(t). Since y(t) € (y/(t))+ = 
span{ N,), Nv. N v(t) x y’(t)}, we have a unique decomposition 


y(t) = kn(t) Ny) + ke(t)(Nya x y'(t)) 
for kn(t), ke(t) € R. 


Definition 2.1. We define the geodesic curvature of the curve y at point p = y(t) 
to be kg(t). 


Definition 2.2. A geodesic on S' is a unit speed curve whose geodesic curvature is 0 
everywhere along the curve. 


The geodesic curvature is a real valued quantity measuring how much the curve 
is veering left or right at a given point, left and right is distinguished by the sign of 
kg(t). The curve is geodesic when informally speaking it is always going straight, 
and more precisely, it is geodesic when the acceleration vector y(t) is either 0 or 
perpendicular to the surface. 
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Ny@) 


Nya) x y'(t) / 


Figure 2 


Using this definition one can easily see that geodesics on a plane are straight lines, 
and circles on a plane has constant geodesic curvature inversely proportional to the 
circle’s radius. Indeed, suppose y is a unit speed parametrized planar circle of radius 
r, so y:[0, 2r] + R? given by y(t) = (rcos (4), r sin (£)). We can check that 


y(t) = (- sin (=).cos (-)) and y"(t)= ~( — cos (<). — sin (<)). 


We can choose Nya) so that Nya) x y'(t) = (—cos (4), — sin (£)) thus the geodesic 


curvature is kg(t) = 1. 


Note that for the case of planar circles, the acceleration vector y” (t) always points 
towards the center of the circle. Using this fact one can see that great circles on 
a sphere, i.e. circles which are intersection of the sphere with a plane through the 
center, are geodesics of the sphere. 


We defined the geodesic curvature making full use of the ambient space R?. How- 
ever, it can be defined intrinsically in a way that depends on the introduction of the 
unique Levi-Civita connection determined by the Riemannian metric. We will not 
develop this general machinery here. 
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For the remaining part of this section we will explore how geodesic curvature is 
related to angles. Given a smooth unit speed arc y: [a,b] > S, we can integrate the 
geodesic curvature with respect to arc-length: 


b 


[keds = f ke(tyat 


4 a 


Think of this quantity as the total geodesic curvature of the arc y. Now suppose that 
our surface is the R? plane, we will compute the total geodesic curvature of y as 
the change in direction of the tangent vector y’(t) as t goes from a to b. Since y 
is of unit speed, we can think of y’(t) as a parametrized path on the unit circle $!. 
For simplicity let us suppose that k g(t) > 0 on the whole curve y. (For curves with 
kg changing sign, we can always decompose into a concatenation of sub-arcs so that 
on each arc kg is either non-positive or non-negative.) Then we have 


b 


b 
/ kg (t)dt = / ly" Ollat: 


a 


The right-hand side is exactly the definition of arclength for the path y’(t) which 
starts at y’(a) and ends at y’(b). Since the arc y’(t) is on the unit circle, its length 
is © + k2m where @ is the angle between the y’(a) and y’(b) as vectors based at 0, 
and k € Z records how many times the path y’(t) winds around the unit circle. 


Remark 2.3. The total geodesic curvature of a planar curve y is equal to the total 
amount of turning (measured by angle) of the tangent vector y’(t). 


We also have the following claim which will be justified below. 
Remark 2.4. We can think of an angle as infinitely concentrated geodesic curvature. 


Consider two infinite unit speed geodesic rays a1, a2: [0, 00) — S starting at the 
same point po = a1(0) = a2(0). Let © € [0, ] be the angle between —a‘ (0) and 
a’,(0). We let a: IR — S be the piecewise geodesic curve so that a(s) = a2(s) for 
s € [0, 00), and a(s) = a,(—s) for s € (—oo, 0]. So © is the exterior angle at the 
vertex po = a(0). 

One can construct a sequence of smooth curves @1, @2,... approximating a by 
smoothing out the angle as follows. Let Uj; D U2 D --- be a sequence of open 
discs all centered at po such that the radius of U; approaches 0 as i — oo. We let 
U; = S\U; be their complements. We can construct smooth unit speed curves a; so 
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yb) 


(a) (b) 


Figure 3. (a) Illustration for Remark 2.3; (b) illustration for Remark 2.4 


that up to a reparametrization it agrees with w in U;. In other words, a; agrees with a 
everywhere except in a small disc U; around the vertex po. Integrating the geodesic 
curvature of these approximating curves we have 


[keds [keds [keds = [keds 


OC; aj NU; aN; aj NU; 


since a;  U; is where a; agrees with a which have 0 geodesic curvature. Now since 
the U;’s are getting smaller, the geometry on U; gets closer to planar geometry where 
Remark 2.3 applies. Along a;  U;, the total turning angle for the tangent vectors 
a; (t) approaches ©, so Sct kg ds — ©, and therefore 


[keas = fim [eas =0. 


It makes sense for a, a curve which is geodesic except at one point, to still have a total 
geodesic curvature measure-concentrated at po. Indeed there are points on a; M U; 
where the geodesic curvature approaches infinity as 7 increases. 

The full statement of the local Gauss—Bonnet theorem is as follows. 
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Theorem 2.5 (Gauss). Let S be a surface in R3 and let D be an open simply con- 
nected region on S whose boundary 0D is smooth except at n vertices where the 


exterior angles at these vertices are 01,..., 9p. 
n 
|| Ka = 2n — (de + [ eas) 
D i=1 aD 


where i ap Kg as is computed as a sum of integrals over the smooth segments of 0D. 


If n = 0, that is, if dD is smooth with no vertices, then the equation reduces 
to [[p K do = 2m — Jn kg ds. This simplified case can be proven by working in 
an orthogonal wv-coordinate patch and writing both K and kg in terms of the first 
fundamental form quantities F, G and partial derivatives. The formula follows after 
applying the Gauss—Green theorem. This is the rigorous approach as seen in [1]. 

Now suppose that 0D has n > 1 vertices. We can approximate D by a sequence 
of open domains D; with smooth boundary 0D; agreeing with dD on almost all the 
smooth parts and replacing the angles with short arcs of large geodesic curvature, just 
like our discussion following Remark 2.4. So as 0D; approach dD we have the total 
measure of geodesic curvature: 


n 
plim, | keds = » 6, + | kg ds. 
aD; tl aD 


lim |[ Kao = ff Kae: 
D;—-D 
D; D 


Thus theorem 2.5 follows from the case where dD is smooth. 


Moreover, 


3 The Gauss—Bonnet theorem 


One thing we did not explain was how in a coordinate patch, the Gaussian curvature 
can be expressed as the exterior derivative of geodesic curvature—an essential step 
to prove the Gauss—Bonnet theorem. To explain this we will define the curvature in a 
different way, hoping to informally justify the Gauss—Bonnet theorem with a different 
perspective. 
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Suppose P is a geodesic polygon on S with k vertices. We only consider the case 
when P bounds a simply connected interior. Let ¢1,...,% be the interior angles 
of P. We can consider following quantity representing the angle excess of P: 


k k 
8(P) = )) bi — (Kk —2)m = 2 — V1 fi). 
i=1 i=1 
Note that (k — 2)z is the sum of interior angles of a Euclidean k-gon, and 27 is the 
sum of exterior angles. If we “add” a triangle T next to P (see Figure 4), we obtain 
a new geodesic polygon Q of k + 1 vertices. Let a1, @2, «3 be the interior angles of 
T and f1,..., Bx+1 be the interior angles of Q. 


Figure 4 


Remark 3.1. Angle excess is additive, that is, the angle excess of the new polygon 
Q is the sum of angle excesses of P and T: 


k+1 k 


3 
8(0) = )> Bi - kD = Di — K-20 + Doi — = 8(P) +8(7). 


i=l i=1 i=1 


Let p be a point on the smooth surface § C R?. Let T = A(A, B,C) bea 
geodesic triangle on S. We now define the curvature at p to be 


. o(T) . (ZA4+2B42ZC)-a2 
K(p) = lim ——— = lim ———_—— 
Tp Area(T) Tp Area(T) 
The limit is taken over smaller and smaller geodesic triangles T containing p in a 
way such that the vertices A, B, C approach p. We will take for granted that this limit 
exists not depending on how the vertices approach p, and that it varies continuously 
as p varies in S. This is an intrinsic definition of the curvature because measurements 


of angles and area are taken on the surface S without refering to how S is embedded 
in R?. 
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Theorem 3.2 (Gauss). Let S be a surface in R? and let T = A(A, B,C) be a 
geodesic triangle in S, that is, T is a simply connected subset of S which is bounded 
by three geodesic segments connecting three points A, B,C. Then 


| Kdo =(ZA+2ZB+4+ZC)—-n. 


T 


Informal proof. Since the closure of T is compact, for any € > 0, we can divide T 
into n sub-triangles 7¢,1,..., Te,, such that each T,; is of diameter less than €. Here 
n depends on € by this relation n ~ (diameter) )? 

We choose a point p-,j in each T,,;. Applying Riemann integration using the 
sub-triangles T,,; we have the following heuristic argument 


I K do = lim » (K(pe,i) Area(Te;)) 
i an 


n 


: 6(Te i) 
=] ———— Area(T. ; 
nan s Area(T7¢,;) ea Tea) 


i=1 


= 6(T) = (4A+ ZB +ZC)-2z (by Remark 3.1). 


Therefore we obtain the local Gauss—Bonnet formula. The above argument works 
not only for triangles but for geodesic polygons in general. 


We have the global Gauss—Bonnet immediately after the following elegant proof. 


Theorem 3.3 (Gauss). Let S be a smooth closed surface then 


I K do = 22x(S). 
Ss 


where y(S) is the Euler characteristic. 


Proof. We give S a triangulation by geodesic triangles 7;. Suppose that the number 
of vertices, edges, and faces are V, E, F respectively. We have y(S) = V—E+ F by 
Euler’s famous formula. This is a topological invariant of the surface S not depending 
on the triangulation. Since each triangle has three edges, we have 3F equals the total 
number of edges double counting each edge twice, because each edge belongs to two 
faces. So3F = 2E, which implies y(S$) = V—-E+F=V— f. 
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Let B®, © ps be the interior angles of triangle 7;. Now applying Theorem 3.2 


we have 


ce = >) Kdo = 573171) 


i=1 i=1 


= Sie? + By) + BS? —7) 


i=1 


= re? +6 ®) is p) wee 
i=1 


=2nV —Fr 
= 27x(S). 


Theorem 3.2 can be applied similarly to obtain a Gauss—Bonnet formula for sur- 


faces with cone points. 


Corollary 3.4. Let S be a surface with k cone points and cone angles 6,,..., 9, 
then 


[ea = 2nx(S)+ 6 ier): 


i=1 
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Chapter 9 


On the non-existence of a perfect map 
from the 2-sphere to the Euclidean plane 


Charalampos Charitos and Ioannis Papadoperakis 


Let E” be the Euclidean plane and let S? C E? be the 2-dimensional unit sphere 
equipped with its usual metric. With respect to this metric, which will be referred 
to as the angular metric, the distance of two points p,q € S? is the length of the 
smallest arc of the unit circle C C S?, with p,q € C. 

A smooth map f from S? or from a part of S* to E? will be called perfect if for 
each p € S? there is a neighborhood U(p) of p in S? such that the restriction of 
Ff on U(p) preserves distances infinitesimally along the meridians and the parallels 
of S? and, furthermore, f preserves angles between meridians and parallels, where 
these angles are measured infinitesimally as it will be explained before Lemma 2. 

A local isometry f: X — Y between two metric spaces X and Y is a map which 
preserves distances locally; that is, foreach p € X there exists a neighborhood U(p) 
of p in X such that the restriction of f on U(p) preserves distances. 

In this chapter, we rewrite a proof of a result of Euler which affirms that a perfect 
map does not exist. Notice that the non-existence of perfect maps was known from 
antiquity although a precise definition was never given. A perfect map was under- 
stood as a map from the sphere or from a part of the sphere into the Euclidean plane 
which respects the basic features of the terrestrial globe, namely, the size, the shape 
and the area. Consequently, it was a common assumption that it was impossible to 
find a single map preserving at the same time all these features; in other words, the 
distortion of geographical maps was an unavoidable problem and it was assumed 
that cartographers should decide which properties to preserve at the expense of oth- 
ers (see [4], Chapter 2). Even in Euler’s paper [1] the definition of a perfect map is 
not explicitly written and one has to read between the lines. In the present paper, the 
proof of non-existence of a perfect map is along the lines of Euler’s argument. It is 
written in modern language and it is independent of certain geometrical considera- 
tions present in Euler’s original paper. For instance, Euler uses in an essential way 
the fact that the vertices p,q,s,r of a quadrilateral of S 2. whose sides consist of 
arcs belonging to meridians or to parallels, are mapped via a perfect map, to points 
P, Q,S, R respectively, which form a quadrilateral of E. 
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Remark 1. It is worth noting that Euler’s theorem was published in 1777, the year 
that Gauss was born, and is part of a longer paper [1] concerning the problem of 
cartography and the construction of plane geographical maps. Gauss proved in 1828 
his famous “Theorema Egregium” which states that, if Sy and Sz are two surfaces and 
P, 4 are points in S; and S> respectively such that the curvature at p is different from 
the curvature at g, then there is no local isometry f from S; to Sz with f(p) = q. 


Using the Riemannian metric of the unit sphere S? (see formula (1) below) we 
can prove that a perfect map from S$? to E? is a local isometry. Therefore, Euler’s 
theorem may be considered as a special case of the theorema egregium. On the other 
hand, one may show directly that a local isometry from S* to E? does not exist 
because there is no way to map a spherical triangle isometrically onto a Euclidean 
triangle. Euler knew this result. For example in [1] he writes that the non-existence 
of a perfect map f: S* > E? is a well-known result. However, he gave in [1] anew 
proof for the non-existence of f through admirable computations, avoiding spherical 
trigonometry. The basic idea of the proof is to translate geometrical conditions to a 
system of differential equations which does not have a solution. Euler’s method is 
very fruitful and can be applied for the proof of similar results. For example, it is not 
hard to prove the following generalization of Theorem 1: there is no map f from S? 
to E? which preserves distances infinitesimally along the meridians and the parallels 
of S? and which sends the angles between meridians and parallels to a constant angle. 

Finally, it is shown by the authors that the hypothesis that a map / preserves 
distances infinitesimally along the meridians is weaker than that of preserving the 
distance between any two points on them. This will be done by means of Proposi- 
tion 5. 


Below we will present first a proof of the following theorem, proved in [1] (see [2] 
for the translation of [1] in English). 


Theorem 1. There does not exist any smooth map ® from a neighborhood of the 
sphere S* to a neighborhood of the Euclidean plane E? which is perfect. 


We consider a parametrization of the Eastern hemisphere So of S? given by the 
map 
(-$4)(-$9) 8 
a ge: 22 : 


p(t,u) = (cos(t) cos(uz), sin(t) cos(z), sin(u)). 


with 


Thus, the w-curves are semicircles lying on a meridian of S? and are geodesics of S?, 
while the f-curves are semicircles parallel to the equator of S? and are not geodesics. 
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With respect to the coordinates (f, uw), the Riemannian metric on S 2 has the form 


ds = V du? + dt? cos? u. (1) 


This formula was obtained by Euler as a corollary of one of his constitutive princi- 
ples of spherical geometry. This principle is contained in [3] and states that infinites- 
imally, spherical geometry behaves like Euclidean geometry. Obviously, the distance 
between two points p, q € S? defined by means of (1), coincides with the angular 
distance. The Riemannian metric ds will not be used in the calculations below. 

We denote by |p; — p2| the angular distance between the points p;, p2 € S* and 
by || Pi — P2|| the Euclidean distance between the points P;, P2 € E?. 


Let t,dt,t + dt,u,du,u+du € (- an Z), du # 0, dt # 0, where dt, du 
are considered as finite variations of f, u respectively. Then, see Figure 1, we set for 


short 


p(t,u)=p, plt,ut+du)=q, p(t+dt,u)=r. 


Figure 1. A parametrization of the Eastern hemisphere So with three points, p(t,u) = p, 
p(t,u+ du) = q,and p(t + dt,u) =r 
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Now, the parallels and the meridians of S? intersect each other perpendicularly, 
as curves. But since the parallels are not geodesics of S?, the angle Gpr of the spher- 
ical triangle pqr is not right. For this reason, Euler measures the angles between 
meridians and parallels infinitesimally. That is, the angle between the meridian and 
the parallel at the point p is by definition equal to 

lim qpr. 
pager 
The following lemma asserts that this angle is right and this explains the condition 


imposed in the definition of a perfect map, where the right angles between meridians 
and parallels are preserved. 


Lemma 2. We have 
jana. 
oa 3 


Proof. Letn = (0,0, 1) be the north pole of S?. Considering on S? the spherical 
triangle T, whose vertices are n, p and r (see Figure 1), by Girard’s theorem, the 
area E of T is equal to 


E=1rp+ipnt+ par—nz. (2) 
We also have 
arp = Frpn = rpq. (3) 


On the other hand, pnr = |dt| and eeu E = 0. Therefore, from relations (2) and (3) 
t—>0 


we obtain 


1 a 
lim gpr = lim ~(E — pnr) = =. 

ar ee PMG 

We also need the following lemma. 
Lemma 3. We have 
in (ir — ) = |sin gat) cost 

sin (SF — Pl) = | sins cos(u). 
Proof. The points p, r belong to a parallel circle C of radius cos u. Therefore these 
points define a chord in C of length 2| sin ( sd t) | cos u. On the other hand, the points 


p,r belong to a geodesic circle Cg of S, i.e. a great circle of S”, and thus the chord 
joining them has length equal to 2 sin (4\r - pi). Therefore our equality follows. 
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Lemma 4. We have 
ir pi 
dt—>0 |dt| 


= cos(u). 


Proof. From Lemma 3 we have 
in(51r— pl) = |sin (5a cost 
sin (—|r — = | sin(— cos(u). 
oe 2 
Therefore, 
- r=pl. s|r—p| | sin ($dz)| cos(u) 
dt>0 |dt| dt>o sin (4|r — pl) 4\dt| 


= in 2 pm [8 4 
w>0sin (5w) ato 5ldt| 


s 


cos(2) 


= cos(u). 


Remark 2. Lemma 4 and the fact that spherical geometry is infinitesimally Euclidean 
implies formula (1) for the Riemannian metric ds on S?. 


Proof of Theorem |. We assume that ® is a perfect map defined in a neighborhood 
of p and we will get a contradiction. 
We set 


@(p(t,u)) = P(t,u) and P(t,u) = (x(t,u), y(t,u)). 
For short, we set 
P(t,u)=P, P(t,u+du)=Q, P(t+dt,u)=R, 


x(t,u)=x, y(t,u)=y. 


The existence of the perfect map ® implies that 


lq — pl .. IQ@—P 
oS in 3 
duro |du| du>o |du| ©) 
Ir — p| _ ||R— Pll 
= img 4 
dio |dt| dito |dt| oe 


lim gpr = lim OPR. (5) 
dt—0 dt—0 
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On the other hand we have 


Ig Pl 4 
duo |du| , 
|Ie@—P| dx\? | (dy? 
1 = /(— mae 
ies (aul cy) Ga 
|R— P| Ox \2 dy\2 
aes0 lal Ge se) 
From Lemma 4, 
Ir— p| _ 
Py mee 


and from Lemma 2 and relation (5) 


lim cos(OPR) = 0. 
dt—0 
This implies that 


(PO, PR) 
||P — QI|||P — RI 
(PO, PR) 
||P — Q||||P — RI 
_ [(P(t,u + du) — P), (P(t + dt, u) — P))| 
iim = lim 2H+.1_W—— 
du>0 dt>0 ||P(t,u+ du) — P||-||P(t + dt,t)— P|| 
(x(t,u+du)—x, y(t,u+du) y) (x(t+dt,u)—x, y(t+dt,u) 2) 
dt 


a 
pe ao] G(,utdu) ERIE Y) | | Seated) —W) | 


_ MCG. ie). GF. ae) 
I (Si ae) GE. ar) 


(55 + trae) 


(BY + (RY VGr + 


where (-, -) denotes the Euclidean dot product. 


0= 


ie a 


lim 
du—>0 di se 


On the non-existence of a perfect map 


Hence, from (3)-(5), we obtain the following relations: 


3 3 
() +Ga) =» 


+ BY = ow 


Ot at 
dx dx dy dy _ 0 
du ot dudt — 


Now, from relations (6) and (7), there exists (¢, @) so that 


=) = (cos(¢), sin(@)), 
(=. ~) = (cos(u) cos(w), cos(wz) sin(w)). 
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(6) 


(7) 


(8) 


Using relation (8), we deduce that a = $+ ¢ org = 3x + @. If we assume that 


o= 5 + @ (the other case is treated similarly), we obtain 


(>. =) = (cos(?), sin(p)), 
(>. 2) = (—cos(u) sin(¢), cos(u) cos()). 
Therefore, since ; ; 
0 r) 
dudt  otdu 


Relations (9) and (10) imply that 
- ae =— ita cos(u) + sin(@) sin(2), 
ot du 


cos(g) =— sin(g) cos(u) — cos(@) sin(). 


Solving the system of equations (11) and (12) we obtain that 


“e =0 and a = —sinu. 
Hence, 
_ Pb Pb | 
~jy One 


which gives a contradiction. 


(9) 


(10) 


(1) 


(12) 
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Now in the spirit of Euler’s method we will prove the following proposition. 


Proposition 5. There does not exist a smooth map WV from a neighborhood U of S* to 
E? which preserves infinitesimally the distance along the meridians and the parallels 
of S* and which sends the meridional arcs of U N S* to straight lines of E?. 


Proof. We assume that such a map W exists and we will get a contradiction. Let 
W(t, u) = (x(t, uv), y(t, u)), where (f, uw) are the parameters introduced above. Since 
W preserves infinitesimally the distance along the meridians and the parallels of S?, 
we have from relations (6) and (7) in the proof of Theorem | that there are variables 
@ and w which are functions of ¢, u such that 


(=, 2) = (eos(g).sin(@)), 
(=. ~) = (cos(u) cos(@), cos(u) sin(@)). 
Since 42 42 
dudt  dtdu’ 
we have 
— sing = — sin(u) cos(@) — cos(u) sin(o) 2. (13) 
u 
cos() = — sin(w) sin(w) + cos(u) cos(o) 22. (14) 
u 


Multiplying (13) with cos(@) and (14) with sin(@) and adding, we have 


(— sin(#) cos(w) + cos(¢) sino) = —sin(u) cos?(w) — sin(u) sin?(@) 
(15) 
<=> sin(¢— wat = sin(u). 


Multiplying (13) with cos(@) and (14) with sin(@) and adding, we obtain 
0 = —sin(u) cos(@) cos(#) — cos(u) sin(o) 2 cos(¢) 
— sin(u) sin(@) sin(@) + cos(uv) cos(o) 32 sin(?) 
which implies that 


cos(u) sin(d — we = sin(u) cos(¢ — w). (16) 
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On the other hand, since the meridians are mapped to straight lines, we have that 


dp 
a = 0 17 
Au (17) 
Differentiating relation (15) with respect to u, we have 
dw\ 0 0? 
cos(¢ — w) (= = 5) <e + sin(@ — we = cos(u) 
hence, from (17), 
dw 0 
—cos(g — 0) = cos(u). (18) 
Multiplying (16) with 
dw 0d 
du ot 
we have 
: dw@\2d0¢ : dw 0d 
cos(u) sin(¢ — w) (=<) 3 = sin(u) cos(¢ — Ons, 


Hence, using (15) and (18), 
cos(u) sin(z) (=) = — sin(u) cos(u) 


0 2 
=> sin(u)(1 + (=) ) = 0 
du 
<= sin(u) = 0. 
But this last equality cannot happen in an open subset of the sphere. Thus we get a 
contradiction which proves Proposition 5. 
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1 Introduction 


The problem of cartography, that is, the problem of constructing the mappings from 
the sphere S? (or from a subset of $*) to the Euclidean plane E? which are “best” in 
some sense is an ancient problem, and several prominent mathematicians have been 
studying it from antiquity to our days. The word “best” refers to mappings which 
present the least distortion with respect to distances, angles or areas, or a combination 
of these. 

In the sequel, by a mapping from S? to E? we mean a map defined either on the 
whole sphere S? or, on a subset of S?. 

In 1777, Euler published three memoirs on cartography: 


(i) “De repraesentatione superficiei sphaericae super plano” (On the representation 
of spherical surfaces on a plane) [2]; 


(ii) “De proiectione geographica superficiei sphaericae” (On the geographical pro- 
jections of spherical surfaces) [3]; 
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(iii) “De proiectione geographica Deslisliana in mappa generali imperii russici usi- 
tata” (On Delisle’s geographic projection used in the general map of the Russian 
empire) [4]. 


In the first memoir, he studied the following three kinds of maps: 


(1) maps where the images of all meridians are perpendicular to a given horizontal 
axis and all parallels are parallel to it; 


(2) maps which are conformal; 


(3) maps which preserve area and such that the images of all meridians intersect the 
images of the parallels at right angles. 


All these maps considered by Euler will be referred to as geographical projections. 

Two years later, Lagrange published two memoirs on cartography [6, 7] in which 
he observed that there exist infinitely many, essentially different, angle-preserving 
(conformal) projections from S$? to E? in the following sense: any two are not equal 
and they remain so after a composition by a similarity of E?. Subsequently, Lagrange 
dealt with the problem of finding the minimal deviation map amongst all these con- 
formal projections. To that end, he defined at each point a magnification ratio, which 
is a deviation factor from conformality at this point. 

In the nineteenth century, the French mathematician and cartographer Nicolas- 
Auguste Tissot made a thorough investigation of geographical projections from S? 
to E? and he studied several of their distortion properties. Following his ideas [11], 
we shall recall the definition of the distortion Dy(p) of f at a point p € Sj, see 
Definition 2.2 below, where f: 5S; — Sz is anon-conformal map of class C 1 between 
two Riemannian surfaces. After the work of Tissot, the term guasiconformal map was 
generated and the problem of existence and uniqueness of the best quasiconformal 
map between Riemann surfaces was revealed as an important and difficult problem. 

For the origin and development of cartography and its relation with quasiconfor- 
mal maps, we refer the reader to the paper by Papadopoulos [9] which inspired the 
author to write the present chapter. The paper [10], also by Papadopoulos, contains 
biographical material on Tissot, and a review of some of his main ideas. We note in- 
cidentally that both Grétzsch and Teichmiiller, who are usually referred for the early 
work on quasiconformal maps, refer to Tissot; cf. the history in [9] and [10]. 

We now explain the results. 

The sphere S? is parameterized by the map 

us 


&:(t,u) € [—2, x] x [- >: =| —> (cos(t) cos(u), sin(t) cos(u), sin(u)). (1) 


For ¢ fixed, the u-curves &(t,-) are the meridians of S? and are geodesics for 
the spherical metric. Setting u = 0, the curve &(-,0) is the equator of S* while 
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the ¢-curves &(-,u), u # O are circles parallel to the equator and they are called 
parallels of S*. The parallels are not geodesics. The circle &(0,-) will be called 
prime meridian. The pair (t, u) define the geographical coordinates of $7, namely f 
is the latitude and u is the longitude of S?. Thus, the equator and the prime meridian 
serve as axes of S* with respect to which the geographical coordinates are measured. 

In what follows, we always assume that & is restricted to the open set (—z, 1) x 


(- a z) because on this set € is one-to-one. We shall denote by So the subset 
&((—z, It) Xx (- oe z)) Cc S?. By means of &, each point e € So with e = &(t,u) 


will be identified with the pair (f, uw) if no confusion is likely. 

A differentiable map f(t, uv) = (x(t,u), y(t, u)) from Sp to E? is called regular 
if (=, a) # (0,0) and (2, a) # (0,0). We denote by Q the class of all regular 
maps of class C’, r > 2 from So to E 2 which preserve area and which send all 
meridians and parallels to perpendicular curves of E7. In fact, the elements of Q are 
all the geographical projections which satisfy the third hypothesis in Euler’s work [2] 
to which we referred to above. It is not hard to see that the elements of Q cannot be 
conformal maps. 

If f(t,u) = (x(t, u), y(t, u)) € Q, we set 


me (t,u) = (=) + (2). (2) 


One can observe that m(t, w) is the length in E? of the tangent vector to the curve 
F(t, -). We will be writing m(t, u) if there is no confusion on the map /. 

Finally, there is a simple area-preserving geographical projection fp € Q which 
was introduced and studied by Lambert in [8, Section 6] (see also [12], pp.76—-85). 
This projection, known as Lambert’s cylindrical equal area projection, was redis- 
covered by Euler in [2]. It is defined by the formula 


fo(t,u) = (ft, sin(v)). 


The goal of this note is to prove that the distortion D ,(t,u) of fo has remarkable 
extremal properties with respect to the distortion of the maps f € Q, namely, for 
every ¢ and u fixed the value D ,(t, u) appears either as a maximum or as a minimum 
of the values {Dy (t,u), f € Q}. 

We have the following theorem which permits us to estimate the deviation from 
conformality of the mappings f € Q, where the conformality structure on S? is 
induced from the standard Riemannian metric ds = \/cos?(u)dt? + du? of S?. 


Theorem 1.1. /f f(t,u) = (x(t, u), y(t, u)) € Q, then for each (t,u) € So, 
(i) Dg (t,u) < De(t,u) if m¢(t,u) = 1 or me (t,u) < cos(u); 
(ii) Dg (t,u) = Dy (t,u) if cos(u) < me (t,u) < 1. 
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In the last paragraph, following Euler’s method, we will find explicit maps f € 9 
and we will show that some of them satisfy the inequality my(t,u) < cos(u) for 
each (t,u) € So. Therefore, the distortion of these maps is bounded below by the 
distortion of fo for each (t, u) € So. 


Remark 1.2. Tissot in [11, Chapitre troisiéme] studied concrete geographical pro- 
jections which preserve area and which he referred to as “projections authaliques” 
(see [11], Ne 70). He made a thorough analysis of these projections and he calculated 
their distortion, in the sense of Definition 2.2 below, at certain points of the sphere 
including the equator and the poles. Using this, Tissot compiled numerical tables 
containing, among other things, useful information about the distortion of these pro- 
jections (see the Complement in [11]). However, since all projections considered by 
Tissot are contained in the set Q, Theorem 1.1 can be used to find estimates of their 
distortion which become quite accurate near the poles and the equator. For instance, 
for any map f € Q, the distortion Dy(t,u) goes to infinity (resp. to 1) as (¢, u) 
approaches the poles (resp. the equator) provided that Dy(t, u) is bounded below 
(resp. above) by Dy, (t, u). 


2 The distortion of area-preserving geographical 
projections 


Let A be a linear isomorphism from E* to E? which is not conformal and let us 
denote by || - || the Euclidean norm. Let C, be the circle of radius r > 0 centered at 
the origin O. It is well known that each C, is sent by A to an ellipse L; centered at O. 
Furthermore, there is a unique pair v1, v2 of perpendicular unit vectors of E? such 
that Av,, Avz are perpendicular. This result was proven by Tissot in ([11], Chapitre 
Premier, Ne 3) by a simple continuity argument. Below, we reprove it in a geometric 
way. 

The following proposition summarizes basic and nice properties of A which are 
used below. 


Proposition 2.1. (1) All the ellipses L; are homothetic with homothety center O. If 
a; is the great axis of L, and b, its small axis, then a,/b, = a,/b; for each r. 

The ratio a, /b, is called the distortion of A. 

(2) There is a unique pair v4, v2 of perpendicular unit vectors of E? such that 
Avy, Av2 are perpendicular. Thus, if |\vi|| = ||v2|| and if ||Avy|| > || Ave|] then 
a1/by = ||Avi||/||Avall. 

(3) A and A! have the same distortion. 
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Proof. Let a1, b; be the great and small axis of L respectively and let w1, w2 be nor- 
mal vectors in the direction of the great and small axis of L, respectively with || w || 
= 4}, ||W2|| = by. Let v1, v2 be unit vectors such that A(v1) = wy and A(v2) = wo. 
Obviously || w1|| < || A(v)|| < || we|| for each unit vector v ~ v1, v2. From the linear- 
ity of A we have also || A(Av1)|| < || A(Av)|| < ||A(Avg)]| and | AQv1)|I/|AQv2) 
= ||w1||/||w2|| = a1/b1. Thus statement (1) follows easily. 

In order to show that the vectors v; and v2 are perpendicular, we consider a circle 
C;, (resp. C7) centered at the origin and tangent to L at the two points where the great 
axis (resp. small axis) intersects L;. Obviously C; is contained in the interior of Ly 
and L, is contained in the interior of C/. Let Kg (resp. Ky) be the image of C/ (resp. 
C;,) by A~!. Then Kg, Kp are homothetic ellipses centered at the origin which must 
be tangent to C; at the endpoints of the vectors vj, v2. This implies immediately 
that v;, v2 are perpendicular and unique and thus statement (2) is proven. Finally 
statement (3) follows easily from (2). 


The above properties of linear isomorphisms of E? give rise to the following 
definition of the distortion of a diffeomorphism of class C! between Riemannian 
surfaces, i.e. surfaces equipped with Riemannian metrics. 


Definition 2.2 (distorsion). Let $;,S2 be two Riemannian surfaces and let f:S;—> S»2 
be a diffeomorphism of class C!. At each point p € Sj the differential df, between 
the tangent spaces 7,5, of S; and the tangent space Ty (p)S ? sends the unit circle 
C of 7,51 centered at the origin to an ellipse L = df,(C). Denoting by a and b 
the great and the small axis respectively of L, the distortion Dy¢(p) of f at p is the 
distortion of the linear map df, and is defined as the ratio a/b. 


In Definition 2.2, if df, preserves the angles then D¢(p) = 1 since a = b. Thus, 
if f is conformal then D¢(p) = 1 for each p € Sj. 
The real function D¢: S; — R will be referred to as the distortion function of f. 
As a special case, let f: Q — f(Q) be a C!-diffeomorphism between the plane 
domains Q and f(Q). Let z = (t,u) = t + iu bea coordinate system on Q and 
let w = f(z) = x + iy bea coordinate system on f(Q). If we consider on Q and 
f (2) the standard Euclidean metric, the distortion of f at z9 € Q may be defined as 
previously and will be denoted by D 'f (Zo), as a reminder of the fact that it is defined 
with respect to the Euclidean metric. 
The following well-known formula can be found in many books (see for exam- 
ple [5], p. 2) 
Bp(zo) = eel + LfeCeoN 
| fz(Z0)| — | fz(Zo)| 


where fz = 1/2(f: —ifu) and fe = 1/2(ft + ifu). 


(3) 
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Obviously, if S;, Sz are Riemann surfaces and f is a C 1 diffeomorphism be- 
tween them, the distortion D;(p) can be defined for each p € S;. This can be done 
by covering S; (resp. Sz) by an atlas of charts which defines the Riemann structure 
on Sj (resp. on S2). 

First, we will calculate the distortion function of the map 

um 

(-1,27)x(-+,= 

f:(-m,2)x (5,5 

given by formula (1). The differential d&(,) of the map € sends the vectors vj = 
(1,0) and v2 = (0, 1) of the tangent space T(,,) E” to the vectors 


) cB? So, 


ry = (—sin(t) cos(wz), cos(t) cos(u), 0), 
rz = (—cos(t) sin(u), — sin(t) sin(z), cos(2)) 


respectively, of the tangent space T¢(¢,4)S” C E%. If we denote by (-,-) the Euclidean 
dot product, we have 


(r1,72) =9, [Ir] = cos(u), — [Ir2l| = 1. 


This implies that the unit circle C in Ti.) E? is sent to an ellipse L = d&yy)(C) in 
Ts(t,u)S7 whose great axis has length 1 and the small one has length cos(u). There- 
fore, the distortion Dg(t, u) of the map & at (t, uv) is 1/ cos(u). Thus, if we consider 
the inverse of the map &, say 1 = &7!: So > (—a, 2) x (— oi Z), and if e = &(t,u), 
we have ; 

Dye) = Sash) (4) 
since from Proposition 2.1 the distortion of d&(,,) is equal to the distortion of die, 
where e = &(t, uv). 

Now let U = (—a,2) x (— %,%) and g:U > f(U) C E? be C!-dif- 
feomorphism such that the differential dg(;,) sends the unit vectors v; = (1,0), 
v2 = (0,1) of T¢)U to perpendicular vectors, say w1, w2 in Titer: Set 
G=gon: So > E’. 

We have the following lemma. 


Lemma 2.3. Let e = &(t,u) € So. The distortion Dg(t, u) of G ate is given by the 
product 


De(t.u) = Dg(t,u). (5) 


1 
cos(u) 
Proof. Since dye sends the vectors r1, r2 of TeS? to v1, v2 € TouyU and dgiu) 
sends v1, V2 to the perpendicular vectors w1, w2 in Tg(t¢,y) E 2 the proof follows easily 
combining relation (4) and statement (2) of Proposition 2.1. 


Area preserving maps from the sphere to the Euclidean plane 141 
Lemma 2.4. Let fo: So > E? be the projection fo(t,u) = (t, sin(u)). Then 


Dy (t,u) = (6) 


cos2(u) 
Proof. The differential dfo,,,,) is expressed by the matrix A = (a eee) Since 
A(1,0) = (1,0) and A(0, 1) = (0, cos), we have from statement (2) of Proposi- 
tion 2.1 that Dyol(t, u) = 1/cos(u) and from Lemma 5 that Df, (¢, v) = 1/ cos*(u). 


3 Proof of the main theorem 


Let f: So > E*, f(t, u) = (x(t, u), y(t, u)) be a geographical projection belonging 
to the set Q. We recall that f is a regular, area-preserving map of class C’, r > 
2, which sends the families of meridians and parallels to families of perpendicular 
curves in E?. 

We start with the general formulas 


dx = pdu+qdt, dy=rdu+sdt, (7) 


where 
Ox ax oy oy 
=—, =, r=->—, s==—. 
Pu OF Ou ai 
Since the images of meridians and parallels are cut perpendicularly, pg + rs = 0 


(see [1] or [2]). Therefore, there is a function n(t, u) such that 
S=—np, g=nr. (8) 


Since f is regular, n(t,u) 4 0 for each (t, uw). Therefore, we may assume that n is 
positive. In fact, if n is negative we may work similarly. 
From (7) and (8) we have 


dx = pdu+nrdt, dy =rdu—npat. (9) 


Following an idea of Euler, we consider the points e = (t,u), e; = (¢ + dt,u), 
é2 = (t,u + du) in So and we set 


E=f(t,u), FE, = f(tt+dt,u), Fo= f(t,u+du). 
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We have, as in [1], 


im lea = € = 1 
du>0 |dul , 
in _ (=) (zy = per. 


Therefore, the projection f is area-preserving if and only if 


n(p? +r7)|du||dt| = cos(u)|dul|dt| 


or 
cos(2) 
= : 10 
From (9) and (10) we have 
r cos(u) pcos(u) 
Be BOWS aaa NE SP 


To simplify the calculations, we set 
p=mcos(¢?) and r=msin(¢), 
where 
ax \2 dy \2 
= — — > 0, 
ae ( Ou ) 7 ( du ) 
and we have 


dx = mcos($)du + 


£280) SO) ay, dy = msin($)du 


cos(u) cos(¢) 
—_ — 
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Setting m(t,u) = k(t, u) cos(u) we have k(t, u) > 0 for each (t, u) and the relations 
above take the form 


sin(¢) d 


cos(#) 
7 ——adt. 


(1) 
We may verify immediately that a solution of (11) is obtained by setting k = 1, 
= 5. Thus we have 


dx = kcos(u) cos(@)du + t, dy =kcos(u) sin(d)du — 


x(t,u)=t, y(t,u) = sin(u) 
which gives Lambert’s equal area projection. 


Remark 3.1. Setting k = a, ¢ = @o, where a, po are arbitrary constants, the 
formulas 


» y(t, u) = asin(u) sin(go) — 


X(t,u) = asin(u) cos(g~o) + ani) t cos(Yo) 


determine a more general solution of (11). This solution differs from the previous one 
only in that the meridians are no longer normal to the x-axis. Actually, it is not hard 
to show that if k is constant in the relation (11) then the angle ¢ must be constant. 


Now we can prove Theorem 1.1, which is restated below. 
Theorem 1.1*. For each f € Q, 


(i) if mg(t,u) = 1 or me(t,u) < cos(u), then 


1 
Dy (t,u) = Son) < Dy (t,u); 


(ii) if cos(u) < mg (t,u) < 1, then 


1 
D f(t, u) = cos?(u) > Dy (t,u). 
Proof. From (11) we have 
Ox Ox __ sin(¢) 
FO) = k cos(u) cos(¢), > 
ay : dy _ _cos() 
an k cos(u) sin(@), vi a 
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Therefore, we may compute immediately the Jacobian of f and verify that it 
sends the tangent vectors v; = (1,0) and vz = (0, 1) of (Ty to the perpendic- 
ular vectors w1 = (sin (£), — cos (£)) and w2 = (kcos(u)(¢), k cos(u) sin(d)) of 
Ty(t,u)E*. Thus Lemma 2.3 can be applied and we have 


Dy (t,u) = De(t,u). 


1 
os(u) 


In order to compute the term D f(t, u), we use formula (3) and the formulas 


AQw) = fe = 5h -it = (2 +2) +i(2-S)), cam 
rose ha hiesine (e243) am 


Therefore the equality 


= 5 3(—2 sin(?) + k cos(u) sin(g)) + i( _ ao — kcos(u) cos($))) 


implies 


lfz\= ; (ae + k cos(u) sin(g)) + (- ao —kcos(u) cost) 
= ; a + k? cos?(u) + 2cos(u) 
a ; (; + kcos(u)). 
1)1 
— s\z + k cos(u)}. 
So, 
A(t ae k(t : k(t 0. (13 
[ACI = 5 | ay tH w) cos(u)| = 5 (eaay tH eostu)) > 0. (130 
Similarly, we may prove that 
1 
|B(t,u)| = 5 E(t.) —k(t,u) cos(u)| (13b) 
We also have 
1 < 1/Vcos(u) < ——. 


These quantities separate the real line into four intervals. Therefore, we distinguish 
the following cases, depending on which interval the value k(t, u) is located in. 
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CASE I: k(t, u) => 1/./cos(u). This is equivalent to 


m(t,u) = k(t,u)cos(u) > Vcos(u). 


Then, (k(t, u))? > 1/cos(u) = 1/k(t,u) —k(t,u) cos(u) < 0 and from (13) we 
have 
1 


BC. = S| eo 


—k(t,u) cos(u) a 5(- + k(t, u) cos(u)). (14) 


1 
K(t,u) 
Therefore, from (13a) and (14), we have 


|A(t, w)| + |B) 


—_ 2 
AGw|-|Bew) (k(t, u))* cos(u). (15) 


Delt, u) = 
Now we distinguish the following two subcases. 
SUBCASE la: k(t, u) => 1/cos(u). We have m(t, wu) = k(t, u) cos(u) > 1, and thus 
(tu) = —— = (k(t)? = 
u) > —— u ——.. 
7 aa ) cos(u) 


Combining the last relation with (15), 


De (t,u) = (k(t, u))? cos(u) = = Dy, (t, u) 


cos(u) 


and Lemma 2.3 implies that 


Dy (t,u) = < Dg (t,u) = (k(t, u))?. 


=o 


SUBCASE Ip: 1//cos(u) < k(t, u) < 1/cos(u). We have ,/cos(u) < m(t,u) <1, 


and therefore the equality 


Dy (tu) = (k@.w)? cos(u) < Daly = Sp 


implies 


Dy (t,u) = (k(t, u))* = Dy)(t,u) = cos2(u) 
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CASE II: k(t, u) < 1/./cos(u). This is equivalent to 


m(t,u) = k(t,u) cos(u) < /cos(u). 


First, we remark that 
k(t,u) < 1/JVcos(u) = > 1/k(t,u) — k(t, u) cos(u) => 0 


| 1 
Rea —k(t,u) cos(u) = =( —k(t,u) cos(u)). 


1 
BS= 
=e 5| k(t, u) 


Therefore, 


JAC) + |B(.w)] 1 


Pre) = TAG yl 1BG] ~ Keay Feostay 


(16) 
We distinguish the following two subcases. 
SUBCASE Ila: k(t, u) < 1. We have m(t,u) < cos(u). So, from (16), the equality 
Dyo(t,u) eS 
5 Uu — nine! = 
fo cos(u) f (k(t, u))? cos(w) 
implies 
1 


1 
b.¢.0) == 2 Ds S 
folt, u) cos?(u) — f (k(t, u))? cos? u 


SUBCASE I]p: | < k(t,u) < 1//cos(u). We have m(t, u) < ./cos(u). Thus the 
equality 


_ 1 _ 
Did =—— 2 3 a) = — 
fou) k2 cos(u) — folt, ¥) cos(u) 
implies 
1 
PI ereeaay cara 


Remark 3.2. For a geographical projection f € 2 we may compute the distortion 
D f(t,u) at a point (¢,u) € So in a more direct way, without using formula (3). 
Indeed, we have seen that the differential dfj;.,) sends the tangent vectors vy; = 
(1,0) and v2 = (0,1) of Ty) E% to the perpendicular vectors w; = (sin(¢)/k, 
—cos(#)/k) and w2 = (k cos(u) cos(@), k cos(u) sin(@)) of Ligggl respectively. 
On the other hand, we have ||w1|| = 1/k and ||w2|| = & cos(u). Therefore, consid- 
ering the cases Ia, Ip, Ia, and II, above, we may find which of the the values || w,|| 
and ||w2|| are greater and calculate the term Ds (t, u). 
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Now, we will show that there exist geographical projections f from So to E? 
which satisfy my(t,u) < cos(u) for each (t,u) € (—x,2) x (- oe Zz). In fact 
in [2] (see p. 36ff of the corresponding translation), Euler resolves the system of equa- 
tions (11) finding particular solutions as follows. 


Setting v = sin(u) in (11), we get 


sin(?) 
k 


cos(p) 


x dt. 


dx =kcos(¢)dv + dt, dy=ksin(¢)dv — 


Assuming that k is a function only of v which is equal to V and the angle ¢ is a 
function only of ¢ and equal to T, 


in(T T 
dx = Vcos(T)dv + =a ) at, dy = V sin(T)dv — ma dat. (17) 
From equations (17) we obtain 
1 : 

x= cos(T) | Vdv = = | sincryat (18a) 

and 
1 

y= sin() | Va = = | coscrydr. (18b) 
Therefore, 

v | vav = : | siocryat = 

v= STU sin =a 

and 


1 
-v Vdv = Say | costa = —4d. 


where a is a constant. Differentiating the relation { Vdv = a/V with respect to the 
variable v, we have 


adV adV 
Vdv =— V2 = 40S— a 


and integrating we have 


a a 
_- — V = ./ ——, 19 
rr e= oy? > "> Vi@4+0 a 


where c is a constant and a/(v + c) > 0. 
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Similarly, for the function 7, we have 


[ siocryat =acos(T) and [costryat = —asin(T). 


Differentiating one of these relations we have 


dt t 
dT =-— => T=--. 
a a 


Finally, for (19) by a simple integration, we obtain 


[ Vav = V2), 


hence 


x= V2a(v + c) cos (<) = /2a(sin(u) + c) cos (<). 


(20) 
y =—y2a(v +c) sin (-) = /2a(sin(uv) + c) sin (-). 
From (20), it follows that 
y t 
— = —tan|—-}, 
x @ (21) 


V¥x2 + y2 = V2a(u +c). 


From the first equation in (21), it is clear that all meridians are represented by straight 
lines emerging as radii from a single fixed point. The second formula implies that 
all parallels are portrayed as concentric circles. These projections are referred to as 
azimuthal geographical projections. 

Finally from (20), we have 


a ca cos? (ut) 


; 7 oy Ee 
m(t,u) = (= du) ——-2a(sin(w) +c) 


and it is not difficult to see that there are constants a and c such that m(t, uw) < cos(u) 
for each (t,u) € So. 
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1 Geometry of polygons 
1.1 Heron’s theorem 


Heron of Alexandria (c. 60 BC) is credited with the following formula that relates 
the area S of a triangle to its side lengths a, b, c: 


S? = (s—a)(s —b)(s —c)s, (1) 


where s = $(a + b +c) is the semiperimeter. 

In the present section of our work we deal with the hyperbolic plane instead of 
the Euclidean one. The hyperbolic plane under consideration is equipped with a 
Riemannian metric of constant curvature k = —1. All the necessary definitions from 
hyperbolic geometry can be found in the book [8]. 

By definition, a cyclic polygon in the hyperbolic plane is a convex polygon in- 
scribed in a curve of constant geodesic curvature (i.e. circle, horocycle or one branch 
of an equidistant line). Useful information about cyclic polygons can be found in [62] 
and [63]. In particular, it is shown in [62] that any cyclic polygon in the hyper- 
bolic plane is uniquely determined (up to isometry) by the ordered sequence of its 
side lengths. In addition, among all hyperbolic polygons with fixed positive side 
lengths there exist polygons of maximal area. Every such a maximal polygon is 
cyclic (see [63]). 
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The following four non-Euclidean versions of Heron’s formula in the hyperbolic 
plane are known since a long time. 


Theorem 1.1 (Heron’s formula in H?). Let T be a hyperbolic triangle with side 
lengths a, b,c. Then its area S = S(T) is given by each of the following formulas: 


(i) sine of ; area formula, 
9 ( 1 s) sinh(s — a) sinh(s — b) sinh(s — c) sinh(s) 
sin® (= S ) = ——————— 
2 4 cosh? (4a) cosh” ($b) cosh? (5c) 
(ii) tangent of + area formula, 
tan? (<5) = tanh (50s - a)) tanh (51s a b)) tanh (50s — c)) tanh (<3): 
4 2 2 2 2 
(ili) sine of + area formula, 


al Nx sinh (5(s — a)) sinh (5(s — b)) sinh ($(s — c)) sinh (5) _ 
= G ) 7 cosh (4a) cosh (3d) cosh (4c) 


(iv) Bilinski formula, 


(5 ) cosh(a) + cosh(b) + cosh(c) + 1 
cos = 
4 cosh (4a) cosh (45) cosh (4c) 


The first two formulas are contained in the book [8], p. 66. The third formula can 
be obtained by square root extraction of the product of the first two. The fourth one 
was derived by Stanko Bilinski in [14] (see also [63]). 

It should be noted that the analogous formulas in spherical space were known be- 
fore the hyperbolic ones. For example, the spherical version of (i) is called Cagnoli’s 
Theorem (see [58], Section 100), (ii) is called Lhuilier’s Theorem (see [58], Sec- 
tion 102), formula (iv) is due to Euler (see [25]). The proofs of (iii) and (iv) can be 
found in [58] (Section 103). 


1.2 Ptolemy’s theorem and annex 


We recall a few well-known facts about cyclic quadrilaterals. A convex Euclidean 
quadrilateral with interior angles A,B,C, D is cyclic if and only if A+C = B+D=nz. 
A similar result for a hyperbolic quadrilateral was obtained by V. F. Petrov [46] and 
L. Wimmer [65]. They proved the following result. 
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Theorem 1.2 (Petrov, 2009, [46]; Wimmer, 2011, [65]). A convex hyperbolic quadri- 
lateral with interior angles A, B,C, D is cyclic if and only if A+ C = B+ D. 


Note that the sum of angles of a hyperbolic quadrilateral is less than 27. Hence, 
for any cyclic hyperbolic quadrilateral we have A+ C = B+ D <n. 

It was shown in [62] that a cyclic n-gon is uniquely determined up to isometry 
by the lengths of its sides. Denote the side and diagonal lengths of a quadrilateral 
as indicated in Figure 1. Then, in the Euclidean case, a quadrilateral is cyclic if and 
only if ef = ac+bd. This is Ptolemy’s theorem. A spherical version of Ptolemy’s 
theorem was established by Darboux and Frobenius. It can be found in the book [37] 
(p. 180, Proposition 4). In hyperbolic geometry, Ptolemy’s identity was proposed by 
T. Kubota [34] and J. E. Valentine [60]. 


Figure 1. A cyclic quadrilateral Q with angles A, B,C, D 


Theorem 1.3 (Ptolemy’s identity in H*). A convex hyperbolic quadrilateral with 
side lengths a,b,c,d and diagonal lengths e, f is cyclic if and only if 


oe 2 ke eg wee oe eee e gee GLK es oh 
sinh (5) sinh (5/) = sinh (52) sinh (5<) + sinh (5) sinh (54). 

An important supplement to Ptolemy’s theorem is the following property of a 
cyclic quadrilateral in the Euclidean plane. Its side and diagonal lengths are related 
by the equation 

e ad+be 


f ab+ted @) 


Together with Ptolemy’s theorem this equation allows one to express the diagonal 
lengths of a cyclic quadrilateral in terms of its side lengths. 
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It was noted in [28] that a similar relationship between the sides and diagonals 
of a cyclic quadrilateral is valid also in hyperbolic geometry. To make the statement 
true, one can replace the side length a by the quantity s(a@) = sinh (4a). In particular, 
formula (2) can be rewritten in the following way. 


Theorem 1.4. The side lengths a,b,c,d and diagonal lengths e, f of a cyclic hy- 
perbolic quadrilateral are related by the following equation 


s(e) _ s(a)s(d) + s(b)s(c) 
s(f) — s(a)s(b) + s(c)s(d) 


By making use of Theorems 1.3 and 1.4 we derive the following formulas for the 
diagonal lengths e, f of a cyclic hyperbolic quadrilateral. Then we have 
s(a)s(d) + s(b)s(c) 


2 = 
s(e)= as ea) (s(a)s(c) + s(b)s(d)), (3) 


Sf) = I EZ olais(e) + s(0)s(d)), a) 

Note that formulas (3) and (4) take place also in Euclidean and spherical ge- 

ometries. In these cases, instead of the function s(a) one should take the functions 

s(a) = a and s(a) = sin (4a) respectively. See [23] and [28] for the arguments in 
the spherical case. 

The above theorems will be used in the next section to obtain a few versions of 


Brahmagupta’s formula for a cyclic hyperbolic quadrilateral. 


1.3 Brahmagupta’s formula 


For polygons with more than three sides, the side lengths do not in general deter- 
mine the area, but they do if the polygon is convex and cyclic (inscribed in a circle). 
Brahmagupta, in the seventh century, gave the following formula for a convex cyclic 
quadrilateral with side lengths a,b,c, d: 


S? = (s —a)(s —b)(s —c)(s — a), (5) 


where s = t(a +b+c+d). See [21], p. 56 for an elementary proof. An interesting 
consideration of the problem can be found in Mobius’s paper [41]. Independently, 
D. P. Robbins [50] and V. V. Varfolomev [61] found a way to generalize these for- 
mulas. The main idea of both papers was to determine the squared area S? as a root 
of an algebraic equation whose coefficients are integer polynomials in the squares of 
the side lengths. See also [26], [18], and [51] for more detailed considerations. 
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One can consider Brahmagupta’s formula as a generalization of Heron’s for- 
mula (1). 

In this section we consider the four versions of Brahmagupta’s formula for a cyclic 
hyperbolic quadrilateral given by the second named author [38]. They are general- 
izations of the respective statements (i)—(iv) of Theorem 1.1. 

In particular, the first statement (i) has the following analog. 


Theorem 1.5 (sine of s area formula). Let Q be acyclic hyperbolic quadrilateral 
with side lengths a,b,c, d. Then its area S = S(Q) is given by the formula 


ofl _ Ss. : = : . ? «ied z 7 
sin (55) — ahs a) sinh(s — 5) sinh(s — c) sinh(s — d)(1 — e), 
where i ; i i 
= 2(+ 2(+ 24% aft 
Yt = cosh (54) cosh (5°) cosh (5°) cosh (54). 
_ sinh (3a) sinh (3b) sinh (4c) sinh (4d) 
~~ cosh (4(s — a)) cosh (4(s - b)) cosh (4(s — c)) cosh (4(s - d)) 
and 


s=satbte+d). 


Note that the value ¢ vanishes if d = 0. In this case, we get formula (i) again. 

Proofs of this theorem and subsequent theorems will be given later, at the end of 
this section. 

The second statement for the case of a hyperbolic quadrilateral has the following 
form. 


Theorem 1.6 (tangent of + area formula). Let Q be acyclic hyperbolic quadrilateral 
with side lengths a,b, c,d. Then its area S = S(Q) is given by the formula 


; 2(13) sinh (4(—a + b +c + d)) sinh (4(@—b +c +4)) 
an = a 
4 cosh (4(a + b —c — d)) cosh (4(a—b +c —d)) 
sinh (4(a + b—c + d)) sinh (4(a+b +c —d)) 
cosh (4(a —b —c + d)) cosh(f(a+b+c+d)) 
It follows from Theorem 1.5 that for any a,b,c,d 4 0 we have | — ¢ > O and 


€ > 0. Hence, 0 < e¢ < 1. Taking into account these inequalities as an immediate 
consequence of Theorems 1.5 and 1.6 we obtain the following corollary. 
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Corollary 1.7. For any cyclic hyperbolic quadrilateral the following inequalities 
take place: 


1 1 
- 2 al ses gk = . = . = . _ 
sin (5 s) < m sinh(s — a) sinh(s — b) sinh(s — c) sinh(s — d) 


and 
tan? (55) > tanh (56 = a)) tanh (56 = b)) tanh (56 = 0)) tanh (56 = d)). 


By squaring the product of the formulas in Theorems 1.5 and 1.6 we obtain the 
following result. It can be considered as a direct generalization of statement (iii) in 
Theorem 1.1. 


Theorem 1.8 (sine of + area formula). Let Q be a cyclic hyperbolic quadrilateral 
with side lengths a,b, c,d. Then its area S = S(Q) is given by the formula 


sin? (55) = a sinh (56 — a) sinh (56 — b)) sinh (56 — 0) sinh (56 — d)), 


where 


and 


1 
s=sa@tbt+cr+d). 


The analogous formula in spherical space can be obtained by replacing sinh and 
cosh by sin and cos correspondingly (see [37], p. 182, Proposition 5). 

Consider a circumscribed quadrilateral Q with side lengths a, b,c, d (Figure 2). 
In this case we havea+c = b+d,hences—a =c, s—b=d,s—c =a, s—d =b. 
As a result we obtain the following assertion. 


Corollary 1.9 (Brahmagupta’s formula for bicentric quadrilateral). Let OQ be a bi- 
centric (i.e. inscribed and circumscribed) hyperbolic quadrilateral with side lengths 
a,b,c,d. Then its area S = S(Q) is given by the formula 


1 1 1 1 1 
sin? (<5) = tanh (52) tanh (5) tanh (5<) tanh (54). 
The analogous formula in spherical space can be given by replacing tanh with tan 
(see [37], p. 46). A Euclidean version of this result is well known (see, e.g., [31]). 


In that case 


S* =abcd. 
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Figure 2. A circumscribed cyclic quadrilateral Q with side lengths a, b,c, d 


The next theorem [38] presents a version of the Bilinski formula for a cyclic 
quadrilateral. 


Theorem 1.10 (Bilinski formula). Let Q be a cyclic hyperbolic quadrilateral with 
side lengths a,b,c, d. Then its area S = S(Q) is given by the formula 


1 1 
cos (<5) = sy (cosha + coshb + coshc + coshd 
— 4sinh (<<) sinh (+5) sinh (<<) sinh (<)). 
2 2 2 2 
Now we outline the proof of Theorems 1.5, 1.6, 1.8, and 1.10. 


Proof. Consider a cyclic hyperbolic quadrilateral Q with side lengths a,b, c,d and 
interior angles A, B, C, D shown in Figure 1. By the Gauss—Bonnet formula we get 
the area 


S = S(O) =2xn-A-B-C-—D. 


To prove Theorem 1.5 let us find the quantities sin? (4s ) and cos? (4s ) in terms 


of a,b,c,d. Since A+ C = B + D (see Proposition 1.2) we have 
1 1 
2 sin? (75) = 1—cos (55) = 1-—cos(am# —(A+C)) = 1+ cos(A+C). 
Hence, 


sin? (<5) a 5(l + cos(A) cos(C) — sin(A) sin(C)). (6) 
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Now we show that cos(A), cos(C’), and the product sin(A) sin(C) can be ex- 
pressed in terms of elementary functions in a,b,c,d. To find cos(A) we use the 
Cosine rule for the hyperbolic triangle ABD. 


east) a) con?) — cosh(f) (7) 
sinh(a) sinh(d) 
We note that cosh(f) = 2s?(f) + 1, cosh(a) = 2s7(a) + 1, and cosh(d) = 
2s*(d) + 1. Putting these identities into equations (4) and (7) we express cos(A) in 
terms of a,b,c, d. After a straightforward calculation we obtain 


cos(A) = Ewer — s?(b) — s?(c) + s?(d) 


(8) 
+ 2s(a)s(b)s(c)s(d) + 2s?(a)s?(d)), 
where 
N= 2(s(a)s(d) + s(b)s(c)) cosh (52) cosh (54). 
In a similar way we get the formula 
cos(C) = = (—s?(a) +5%(b) + 5(c) —s°(d) . 


+ 2s(a)s(b)s(c)s(d) + 25(b)s2(c)), 
where 


MW = 2(s(a)s(d) + s(b)s(c)) cosh (50) cosh (5°) 


Note that sin(A) sin(C) > 0 and sin?(A) sin?(C) = (1 — cos?(A))(1 — cos?(C)). 
Then we take square roots in the latter equation, where cos(A) and cos(C) are given 
by formulas (8) and (9). Thus we express the product sin(A) sin(C) in terms of 
a,b,c, d. Substituting cos(A), cos(C), and sin(A) - sin(C) into (6) and simplifying 
we get 

1 1 1 
~2(io\)_ fe S(- =(¢— 
sin ( Ss) W sinh (5( atb+c+d)) sinh (5(a b+c+d)) i 
1 1 
sinh (5 +b—-c+ d)) sinh (5 +b+ce- d)). 


This proves Theorem 1.8. 


In a similar way, from the identity cos? (4S) = 1+ cos (4S) = 1—cos(A+C) 
we have 
cos” (<5) = = cosh (<a +b-c— d)) cosh (<a —b+ec- d)) 
4 aid 4 4 (11) 
1 1 
cosh (j@ —b-c+ d)) cosh (5 +b+cet+ d)). 
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The following lemma can be easily proved by straightforward calculations. 


Lemma 1.11. The expression 
_ cosh (¢(a + b —c —d)) cosh ($(a—b + c—d)) 
~~ cosh (4(-a +b +c + d)) cosh ($(a—b+c+d)) 
cosh (F(a —b-c+ d)) cosh (F(a +b+c+ d)) 
cosh (4(a +b-—ct+ d)) cosh (4(a +b+c- d)) 
can be rewritten in the form H = | — €, where 
sinh (5a) sinh (4b) sinh (4c) sinh (4d) 
cosh (4(s _ a)) cosh (4(s — b)) cosh (4(s — c)) cosh (4(s - d)) 


and 


1 
s=slatbte+d). 


Taking four times the products of equations (10) and (11) we have 


sin? (55) = =sinh (5-2 +b+ce+ d)) sinh (5(a —b+ct+ d)) 


1 1 o 
sinh (5( +b-—ct+ d)) sinh (5 +b+e —d))H, 


where # is the same as in Lemma 1.11. 

Then the statement of Theorem 1.5 follows from equation (12), Lemma 1.11 and 
the evident identity s—a = 4(-a +b+c-+d).To prove Theorem 1.6 we divide (10) 
by (11). 

Finally, the Bilinski formula (Theorem 1.10) follows from the identity cos (4 S ) = 
cos? ( +S )- sin? (is ) and the above mentioned equations (10) and (11). 


1.4 Bretschneider’s formula 


In 1842, Carl Bretschneider [15] related the area of an arbitrary Euclidean quadrilat- 
eral to its side lengths and the sum of two opposite angles. The area S' of a Euclidean 
quadrilateral with side lengths a, b,c, d and opposite angles A and C is given by the 
formula 


S? = (s —a)(s —b)(s —c)(s — d) abed cos? (5(A +0), 


where s = i(a +b +c +d) is the semiperimeter (for a proof see e.g. [47], p. 89). 
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The statement of the theorem remains valid if one substitutes A + C with the sum of 
another pair of opposite angles B + D. By making use of the identity 


A+B+C+D=2n 


for any Euclidean quadrilateral we can rewrite the Bretschneider’s theorem in the 
following more symmetric way 


S? = (s —a)(s — b)(s —c)(s — d) abcd sin? ({(4- B+ C -D)). 


Precisely, this statement allows a generalization for the case of a non-Euclidean 
quadrilateral. In the particular case of a cyclic quadrilateral (when A+ C = B+ D) 
we get a Brahmagupta’s formula (5). Recall that the sum of angles is not equal to 27 
anymore. 

The following theorem gives a hyperbolic version of Bretschneider’s formula. 


Theorem 1.12 (Baigonakova and Mednykh, 2012, [11]). The area S of a hyper- 
bolic quadrilateral with side lengths a,b,c,d and angles A, B,C, D is given by the 


formula 
sin? (<5) 
= gpsith (300 a) sinh (5109) inh (515 0) sinh (566— 
—tanh (52) tanh (5%) tanh (5°) tanh (54) sin? (G4 -B+C-D), 


where s = $(a +b+c +4) is the semiperimeter. 


Proof. Consider the Poincaré disk model of the hyperbolic plane. This is the unit 
disc U = {z € C:|z| < 1}, provided with the metric ds = Sie. The boundary 
of U consists of the infinitely distant points and called absolute. The geodesics in 
this model are circle arcs and segments orthogonal to the absolute. Let z and w be 
arbitrary points of the disc U, and 0 be the origin. Then the hyperbolic and Euclidean 
distances between these points (p(z, w) and |z — w| correspondingly) are related by 


the formula ([13], p. 22-123) 


sinh (Sp(z, w)) 


cosh (Ep, 2)) cosh (4010.0) (3) 


|z—w| = 
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Consider a hyperbolic quadrilateral ABCD (Figure 3). We suppose that the ver- 
tex A coincides with the origin 0. From (13) we find the sides of the Euclidean 
quadrilateral ABCD with respect to the hyperbolic lengths a,b,c, d,e, f. We have 


AB =tanh (<<) ee = (14a) 
2 cosh (4a) cosh (4e) , 
1 sinh (3c) 
AD =tanh(-d), CD =————2->__.. 14b 
ae (5 ) cosh (4e) cosh (3d) Core 


Figure 3. A hyperbolic quadrilateral ABCD in the Poincaré model of H? 


Let C* be a point symmetric to C with respect to the absolute. To find the 
Euclidean distances BC* and DC* we use the identity C* = 1/C and the fol- 
lowing formula from ([{13], p. 123) 


1 Z—W 
tanh (Set. w)) = | = | : (15) 
We get from (13) and (15) 
h (4b h (4 
aor = ost (24) ma pees (16) 


cosh (3a) sinh (Se) cosh (3d) sinh (4e) , 
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Denote the angle between the arc BC and the chord BC by x, and the angle 
between the arc DC and the chord DC by y. Then the angle measure of BC is 2x, 


while the angle measure of CD is 2y. Recall that the inscribed angle is equal to 
half of the corresponding cyclic arc. Then we obtain that the angles BC*C and 
CC*D are equal to x and y respectively. Denote by A, B, C, D the interior angles 
of the hyperbolic quadrilateral ABCD, and by A, B,C, D the interior angles of the 
Eucledean quadrilateral ABCD. We have 


nw An 


A=A, B=B+x, C=C+x+y, D=D+y. 
Taking into account that A+ B+ C + D = 2m, we have 
A+B4+C+D+2x+2y = 2n. 


Hence, 


1 1 
x+y=-(2n —-A—B-—-C-—D)=-S. 
2 2 
From the Euclidean triangle BC* D we have 


) _ BDF (BC = DCryY 


sin” (5+) = sin? (25 4BC*-DC* 


4 
Now using (14) and (16) we substitute the Euclidean lengths in the latter equation 
with the hyperbolic ones. Then 


sin? (<5) = a (sinh (5°) sinh (51) 
— (cosh (52) cosh (5°) — cosh (50) cosh (54). 


This formula expresses the area of a hyperbolic quadrilateral in terms of its sides and 
diagonals. 

The next step is to find a similar formula for sin? ( 4(A -B+C- D)). First, 
we need the following Bretschneider relation for the Euclidean quadrilateral ABCD 
([47], p. 85) 


(17) 


(AC - BD)* = (AB- CD)? + (BC - AD)? 
—2AB-BC-CD-ADcos(A+C). 
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Since 
At O=AtCtxtyant (A-B+C-D), 
we get 
cos (544- B+c — D)) = —cos(A + C) 


_ (AC - BD)? — (AB. CD)? — (BC - AD)? 
a 2AB-BC-CD-AD , 
or 


1 AB-CD + BC-AD)? —(AC - BD)? 
sin2 (=(4- B+c — D)) — (A860) TE BCsAD) SAC eaD) 
2 4AB-BC-CD-AD 
As before, by (14) and (16) we obtain 


sin? (5(4- B + C ~ D)) 


2 2 


_ (sinh (4a) sinh (4c) — sinh (35) sinh ($d))” — (sinh (3e) sinh (3 f)) 
7 4 sinh (4a) sinh (45) sinh (4c) sinh ($d) , 

(18) 
Eliminating the expression ( sinh (Se) sinh (4 f )) from (17) and (18) and making 
use of standard formulas we finally obtain 


sin? (<5) a - sinh (56 _ a)) sinh (56 - b)) sinh (56 — 0)) sinh (56 — d)) 


— tanh (52) tanh (50) tanh (5¢) tanh (54) 


sin? (j(4- B+ C ~ D)). 


The analogous theorem holds in spherical geometry as well (for more details see 
the recent work [10] or the old papers [12] and [29]). 


Theorem 1.13 (Baigonakova and Mednykh, 2012, [11]). The area S of a spheri- 
cal quadrilateral with side lengths a,b,c,d and angles A, B,C, D is given by the 
formula 


-S 
4 cos (3a) cos (45) cos (4c) cos (3d) 


— tan (52) tan (5) tan (5°) tan (54) sin? (G4 —-B+C- D)), 


where s = $(a +b+c+4d) is the semiperimeter. 
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1.5 Casey’s theorem 
John Casey generalized Ptolemy’s theorem in the following way (see [16], p. 103). 


Theorem 1.14 (Casey’s theorem). Let four circles O,, Oz, 03, O4 on the Euclidean 
plane I? be internally tangent to a circle O in the prescribed order. Denote by tj j 
the length of the common external tangent of the circles O;, O;. Then 


ti2 34 + to3 tig = 113 f24. 


We say that a common tangent of two circles is external if it does not intersect 
the segment joining the centers of the circles. 
The following hyperbolic version of Casey’s theorem is given in [7]. 


Theorem 1.15 (Abrosimov and Mikaiylova, 2015, [7]). Let four circles O,, O2, 
O3, O4 on the hyperbolic plane WH? be internally tangent to a circle O in the pre- 
scribed order. Denote by tj the length of the common external tangent of the circles 
O;, O;. Then 


1 1 1 1 1 1 
sinh (5112) sinh (5134) + sinh (50) sinh (5114) = sinh (51132) sinh (512), 


Proof. Denote by O; the centers of the circles respectively (Figure 4). Let R; be the 
radii of the circles O; and K; the points of tangency with the circle O. 


K2 


Figure 4 
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Consider the quadrilateral with sides O;O;, Ri, Rj, and tj; (Figure 5). Denote 
by d the diagonal which divides the quadrilateral into triangles 1 and 2. Since the radii 
R; and R; are orthogonal to the common tangent t;;, we can apply the hyperbolic 
Pythagorean theorem for triangle 1 (see, e.g., [8]) 


cosh(d) = cosh(R;) cosh(¢;;). (19) 


Denote the angle between the diagonal d and the tangent ¢;; by 5. Then by the 
Sine rule for the hyperbolic triangle 1 we have 


sinh(R;) 


sin 6 = “sinh(d) © 


Figure 5 


We use the second Cosine rule for the hyperbolic triangle 2 (see, e.g., [8]) 
cosh(O; 0;) = cosh(R;) cosh(d) — sinh(R;) sinh(R;) cos (> = 8), (20) 
Substituting cosh(d) from (20) into (19) we obtain 
cosh(O; O;) = cosh(t;;) cosh(R;) cosh(R;) — sinh(R;) sinh(R;). (21) 


We apply the first Cosine rule to the triangle OK; K; where OK; = OK; = R 
(see Figure 6) 
cosh?(R) — cosh(K; K;) 
sinh?(R) , 
Using the first Cosine rule for the hyperbolic triangle OO;O;, where OO; = 
R— Ri, OO; = R— R;, we have 


cos(ZK;OK;) = (22) 


cosh(OQ; O;) = cosh(R — R;) cosh(R — R;) 
— sinh(R — R;) sinh(R — R;) cos(Z.K; OK;). 
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= 


Figure 6 


Kj 


Combining the latter expression with (21) and (22) we obtain 


sinh?(R) cosh(R;) cosh(R;) h2 (3 


= FaaHRP UR aah ROR =ti; ). 23 
2 sinh(R — R;) sinh(R — R;) sin ii) (23) 


1 
sinh? (5Ki K;) 
2 


2 


The points of tangency of the circles O01, Oz, 03, O4 with the circle O form a 
cyclic quadrilateral K, Ky K3K4 in H?. According to the hyperbolic Ptolemy theo- 
rem (Theorem 1.3), the segments K; K; satisfy the equation 


i 1 
inhi (5KiKs) inhi (5 KoKs) 
1 1 1 1 a 
= sinh (5 Kika) sinh (5KaKs) + sinh (5 K2Ks) sinh (5KiKs). 


We take the square root of (23) and substitute the expressions for sinh ( 3K; K i) 
in (24). Then we divide the resulting equation by 


nh? R cosh(R,) cosh(R2) cosh(R3) cosh( R4) 
sin So 
2 sinh(R — R,) sinh(R — R2) sinh(R — R3) sinh(R — R4) 
to obtain 


sin. sin. Siu sin + sin sin . 


The spherical analog of Casey’s theorem holds as well [7]. 


Theorem 1.16 (Abrosimov and Mikaiylova, 2015, [7]). Let four circles Oi, O2, 
O03, O4 on the sphere S? be internally tangent to a circle O in the prescribed or- 
der. Denote by tj; the length of the common external tangent of the circles O;, Oj. 
Then 


sin | = sin | = sin { = sin { = = sin {[— sin | = 
2 12 7 34 7 23 2 14 7 13 2 24 
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One can prove the spherical version of the theorem using the same arguments as 
in the hyperbolic case. The only difference is to replace hyperbolic trigonometric 
relations by corresponding spherical ones. 


2 Geometry of polyhedra 


Calculating volumes of polyhedra is a classical problem, that has been well known 
since Euclid. It remains relevant nowadays, partly due to the fact that the volume 
of a fundamental polyhedron is one of the main invariants for a three-dimensional 
manifold. 


2.1 Volume of hyperbolic orthoscheme 


Volume formulas for non-Euclidean tetrahedra in some special cases have been known 
since Lobachevsky, Bolyai and Schlafli. For example, Schlafli [53] found the volume 
of an orthoscheme in the 3-dimensional sphere $7. Recall that an orthoscheme is an n- 
dimensional simplex defined by a sequence of edges (vo, v1), (V1, V2), .--, (Un—1, Un) 
that are mutually orthogonal. In three dimensions, an orthoscheme is also called a 
birectangular tetrahedron. By definition, a 3-dimensional orthoscheme has three di- 
hedral angles > and three others, which we will refer as essential ones (see Figure 7). 


U1 


U3 


vo 
v2 


Figure 7. An orthoscheme T = T(A, B, C) with essential dihedral angles A, B,C, all other angles 
It 

are > 
2 
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Theorem 2.1 (Schlafli, 1858, [52]). Let T be a spherical orthoscheme with essential 
dihedral angles A, B,C. Then its volume V = V(T) is given by the formula 


1 
V = 7S(A, B,C), 


where 
s(F-nn 5-2) 
= S(x, y,z) 
= 3 (- — sin(x) sin(z) )” cos(2mx) — cos(2my) + cos(2mz) — 1 
Saye sin(x) sin(z) m2 
— x2 4 y?— 22, 


and D = \/cos?(x) cos2(z) — cos2(y). 


The function S(x, y, z) appearing in Theorem 2.1 is called the Schldfli function. 

The volume of a hyperbolic orthoscheme was obtained independently by Nikolai 
Lobachevsky [35] and Janés Bolyai (see [57], [56], or [64]). The following theorem 
was suggested by G. S. M. Coxeter [20]. It represents a result of Lobachevsky in a 
very simple form. 


Theorem 2.2 (Lobachevsky, 1835, [35]; Coxeter, 1835, [20]). Let T be a hyperbolic 
orthoscheme with dihedral angles A, B,C. Then its volume V = V(T) is given by 
the formula V = ES(A, B,C), where S(A, B,C) is the Schlafli function. 


J. Bolyai found the volume of a hyperbolic orthoscheme in terms of planar angles 
and an edge length. This result was not published during his lifetime. It can be 
found in the work [57], pp. 111-118 collected on the base of original manuscripts in 
reliquias by Janés Bolyai (see also [56] and [64]). 


Theorem 2.3 (Bolyai, 1832). Consider a hyperbolic orthoscheme T with essen- 
tial dihedral angles along AC, CB, BD, and all other dihedral angles equal to > 
(Figure 8). T is given by its planar angles a, B, y and edge length z. Then its volume 
V = V(T) can be found by the formula 


Zz 


Pe tan(y) u sinh(u)du 
a 2 tan(B) cosh? (u) _ / cosh? (u) -_ , 
0 ( cos? (a) 1) cos2(y) | 
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D 


CD 1 CBA 
AB 1 BCD B 


Figure 8. An orthoscheme 7 given by planar angles a, 8, y and edge length z 


2.2 Volume of ideal tetrahedra and octahedra 


An ideal tetrahedron is a hyperbolic tetrahedron with all vertices at infinity. Opposite 
dihedral angles of an ideal tetrahedron are pairwise equal and the sum of dihedral 
angles at the edges adjacent to one vertex is A+ B+ C = 7z (see Figure 9). 


Figure 9. An ideal tetrahedron T = T(A, B, C) and an ideal octahedron O = O(A, B,C, D, E, F) 


The volume of an ideal tetrahedron has been known since Lobachevsky [35]. 
J. Milnor presented it in very elegant form [39]. 
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Theorem 2.4 (Lobachevsky, 1835, [35]; Milnor, 1982, [39]). Let T be an ideal 
tetrahedron with dihedral angles A, B and C. Then its volume V = V(T) is given 
by the formula 

V = A(A)+ A(B) + ACC), 


where 
x 


A(x) = = [08 |2 sin t|dt 
0 
is the Lobachevsky function. 


Corollary 2.5. The maximal volume of an ideal tetrahedron is attained in the case 
when the dihedral angles A= B=C = a0. It is equal to 


Vax = 3A( =) = 1.01494... 


A more general case of a tetrahedron with at least one vertex at infinity was in- 
vestigated by E. B. Vinberg [8]. 

An ideal octahedron is a hyperbolic octahedron with all vertices at infinity. Con- 
sider a symmetric ideal octahedron whose opposite dihedral angles are pairwise equal 
(see Figure 9). By definition, the sum of the dihedral angles at the edges adjacent to 
any vertex is equal to 277. We have 


A+B+C+D=2n, A+C+E+F=2n7, B+D+E+4+F =2r, 
or 
C=n-A, D=xnx-B, F=n2-E. 
Hence, A, B,C, D, E, F € (0,2). The volume formula for such an octahedron is 
very similar to Milnor’s formula for ideal tetrahedra and is given in [9]. 


Theorem 2.6 (Baigonakova and Mednykh, 2010, [9]). Let O be a symmetric ideal 
octahedron with dihedral angles A, B,C, D, E, F. Then its volume V = V(O) is 
given by the formula 
Vol 
oO) _ 
2 


1 1 
(5+ 4+ B+) +A(5@-A-B+B) 
1 1 
+ A(5(0 + 4A-B-E)) + A(5@—A+B-B)), 
where A(x) is the Lobachevsky function. 


Corollary 2.7. The maximal volume of an ideal octahedron is attained in the case 
when A= B=C=D=E= F = &., It is equal to 


Alan 


Vinx = 8A(=) = 3.66386... 
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2.3 Seidel’s conjecture 


In 1986, J. J. Seidel [54] conjectured that the volume of an ideal hyperbolic tetrahe- 
dron can be expressed as a function of the determinant and the permanent of its Gram 
matrix. Recall that the formula expressing the volume of such a tetrahedron in terms 
of dihedral angles has been known since Lobachevsky and Bolyai (Theorem 2.4). In 
spite of this, Seidel’s problem had not been solved for a long time. Ten years later, 
a strengthened version of Seidel’s conjecture was suggested by Igor Rivin and Feng 
Luo. They supposed that the volume of a non-Euclidean tetrahedron (hyperbolic or 
spherical one) depends only on the determinant of its Gram matrix. It was shown 
in [1] and [2] that the strengthened conjecture is false, while Seidel’s conjecture is 
true within certain conditions. 

Consider a non-Euclidean tetrahedron T in the sphere $? or in the hyperbolic 
space H?. Denote its vertices by numbers 1,2,3,4. Let Aj; denote the dihedral 
angle at the edge joining vertices i and 7. For convenience, we set Aj; = x for 
i = 1,2,3,4. It is well known [8] that, in the hyperbolic and spherical spaces, 
the tetrahedron T is uniquely (up to isometry) determined by its Gram matrix G = 
(— cos(Ai;))i,j=1,...,4- 

Recall that the permanent of a matrix M = (mjj;)ij=1,2....,n is defined by 


n 
per(M) = So mij per(Mj;), per(mi;) = mij. 
i=1 


where M;; is the matrix obtained from M by removing the i-th row and j-th column. 
Conditions for the existence of spherical and hyperbolic tetrahedra in terms of Gram 
matrices are given in [36] and [59], respectively. 

The solution of Seidel’s problem, which was posed in [54], is given by the fol- 
lowing theorem. 


Theorem 2.8 (Abrosimov, 2010, [2]). In each of the classes of acute or obtuse! ideal 
tetrahedra the volume is determined uniquely by the determinant and permanent of 
the Gram matrix. 


Proof. As is known (see, e.g., [39]), opposite dihedral angles of an ideal tetrahedron 
are pairwise equal and the sum of the dihedral angles at the edges adjacent to one 
vertex is A+ B+ C =z (Figure 9). 


: . sd 
! By an obtuse tetrahedron we mean a tetrahedron with at least one dihedral angle > 5 
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Thus, we can take C = 2 — A — B. The Gram matrix has the form 


1 —cos(A) —cos(B)  cos(A + B) 
G —cos(A) 1 cos(A+ B) —cos(B) 
—cos(B)  cos(A + B) 1 —cos(A) 
cos(A+ B) —cos(B) —cos(A) 1 
We have 


det(G) = —4sin?(A) sin?(B) sin?(A + B), 


per(G) = 4+ 4cos?(A) cos?(B) cos?(A + B). 


To prove Theorem 2.8, we show that the dihedral angles of an ideal tetrahedron 
are uniquely (up to a permutation) determined by det G and per G in each of the 
cases: acute angled tetrahedron and obtuse angled tetrahedron. 

Without loss of generality, we can assume thatO < A< B<C=a2-A-B. 
Then the dihedral angles A, B are a priori acute, and the angle C is either acute or 
obtuse. In the former case, the tetrahedron under consideration is acute-angled, and 
in the latter case, it is obtuse-angled. 

Let us introduce the new variables 


x = sin(A)sin(B), y =cos(A)cos(B) 


and show that, for a fixed left-hand side, the solutions of the system of equations 
1 2 2 
—7 det(G) = x2(1— (y—x)), 


i 
qPer(G) = y(y—x)? +1 


correspond to one tetrahedron (up to isometry) in each of the two cases mentioned 
above. 


Suppose that the system has a pair of different solutions (a, b) and (x, y). Then 
we have 


riba) jax =o =x), 


b?(b—a)? = y*(y — x)’. 
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For the angle C to be acute means that 
cos(A) cos(B) — sin(A) sin(B) = —cos(C) < 0, 


i.e., both solutions satisfy the inequalities b(b — a) < 0 and x(x — y) < 0. If the 
angle C is obtuse, then the reverse inequality holds. This allows us to take a square 
root in the second equation without loosing solutions: 


a*(1—(b—a)*) = x?(1—(y — x)), 
b(b—a)=y(y—X). 


Expressing x from the second equation and substituting the resulting expression 


into the first one, we obtain a sixth-degree polynomial equation in y. Fortunately, it 
decomposes into the linear factors and the biquadratic polynomial 


(b—y)- (b+ y)-(y* —(@ +44 + 2ab — 2a*b — b* + 2ab? — b*)y? 
+ a*b? — 4a°b? + 6a7b* — 4ab° + b°) =0. 
Thus, all solutions can be found in radicals. Substituting the expressions for x and 


y in terms of dihedral angles, we see that different solutions of the system correspond 
to the same ideal tetrahedron 7(A, B, C) up to reordering the dihedral angles. 


Note that, in Theorem 2.8, the assumption that the tetrahedron is either acute- 
angled or obtuse-angled cannot be dispensed with. This is demonstrated by the fol- 
lowing example. 


Example 2.9. Consider a pair of ideal tetrahedra, the obtuse-angled tetrahedron 
T,(s, 5, 7 — 2s) and the acute-angled tetrahedron T> (¢, 3 (x —ft), 3 (a _ t)), where 


y2+ 4+ sea 


2/2 
(4 


S = arccos ( 


—26 + 10V *) 
t = arccos 3 ; 


The determinants and the permanents of the Gram matrices of these tetrahedra 
coincide. We have 


det(G(T\)) = det(G(T2)) = 107—S1V17 


128 
163 + 85/17 
per(G(7\)) = per(G(T2)) = — 


At the same time, the volumes of the tetrahedra T; and T> are different and equal 
to 0.847365 and 1.01483 respectively. 
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2.4 Volume of a general hyperbolic tetrahedron 


Despite the fact that partial results on the volume of non-Euclidean tetrahedra were 
known, a volume formula for hyperbolic tetrahedra of general type remained un- 
known until recently. A general algorithm for obtaining such a formula was indicated 
by W.-Y. Hsiang in [30]. A complete solution was obtained more than ten years later. 
A general formula was proposed in paper by Y. Cho and H. Kim [17]. However, it 
was asymmetric with respect to permutation of angles. J. Murakami and Y. Yano [44] 
proposed a formula expressing the volume by dihedral angles in a symmetric way. 
A year later, A. Ushijima [59] presented a simple proof of the Murakami—Yano for- 
mula. He also investigated the case of a truncated hyperbolic tetrahedron. 


It should be noted that in all these studies the volume is expressed as a linear 
combination of 16 dilogarithms or Lobachevsky functions. The arguments of these 
functions depend on the dihedral angles of the tetrahedron and some additional pa- 
rameter, which is the root of some quadratic equation with complex coefficients in a 
sophisticated form. 


The geometric meaning of that formula was explained by P. Doyle and G. Lei- 
bon [24] in terms of Regge symmetry. A clear description of these ideas and a com- 
plete geometric proof of the formula was given by Y. Mohanty [42]. In particular, she 
proved the equivalence of Regge symmetry and homogeneity (scissors congruence) 
in hyperbolic space [43]. 

In 2005, D. A. Derevnin and A. D. Mednykh [22] proposed the following integral 
formula for the volume of a hyperbolic tetrahedron. 


Theorem 2.10 (Derevnin and Mednykh, 2005, [22]). Let T(A, B,C, D, E, F) be 
a compact hyperbolic tetrahedron with dihedral angles A, B,C, D, E, F. Then its 
volume V = V(T) is given by the formula 


22 
1 lo cos ($(A + B+ C +2z))cos($(A+ E+ F +2)) 
sin(3(A+ B+D+E+z))sin(3(A+C+ D+F +z)) 


ZI 
cos ($(B + D+ F +z))cos(3(C +D+E+z)) F 
sin ((B +C+E4+F + z)) sin (32) 


Z, 


where Z, and Zz are the roots of the integrand satisfied the condition 0 < z2—z, < X. 
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More precisely, 
ks ky k3 ky 
Z, = arctan ey, — arctan G): Z2 = m — arctan i — arctan (.): 
where 


k, = —cos(S) —cos(A + D)—cos(B + FE) —cos(C + F) —cos(D+ E + F) 
—cos(D + B+ C)-—cos(A+ E4+C)-cos(A+ B+ F), 


kz = sin(S) + sin(A + D)+ sin(B + FE) + sin(C + F)+sin(D+ E+ F) 
+sin(D+ B+C)+sn(A+£+C)+sin(A+ B+ F), 


k3 = 2(sin Asin D + sin Bsin E + sinC sin F), 


ka= fk? +2 —K2, 


S=A+B+C+D+E+F. 


2.5 Sforza’s formula for non-Euclidean tetrahedra 


Surprisingly, but more than 100 years ago, in 1907, Giuseppe Sforza found a fairly 
simple formula for the volume of a non-Euclidean tetrahedron. This fact has became 
widely known after a discussion between the second named author and J. M. Mon- 
tesinos at the conference in El Burgo de Osma (Spain), August, 2006. Unfortunately, 
the outstanding work of Sforza [55] published in Italian has been completely forgot- 
ten. 

The original arguments by Sforza are based on some identity given by H. W. Rich- 
mond [49] besides the Schlafli formula. Then he used the Pascal equation [45] for 
minors of the Gram matrix and some routine calculations. In this subsection we 
present another simpler proof of Sforza’s formula. The idea of the proof belongs to 
the authors [6]. 

Consider a hyperbolic (or a spherical) tetrahedron T with dihedral angles A, B, C, 
D, E, F. Assume that A, B, C are dihedral angles at the edges adjacent to one vertex 
and D, E, F are opposite to them correspondingly (see Figure 10). Then the Gram 
matrix G(T) is defined as follows: 


1 —cos(A) —cos(B) —cos(F) 
G=l7 cos(A) 1 —cos(C) —cos(E£) 
~ | —cos(B) —cos(C) 1 —cos(D) 


—cos(F) —cos(E) —cos(D) 1 
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Figure 10. A tetrahedron T = T(A, B, C, D, E, F) in H? or $3 


Denote by © = (cj;);,;=1,2,3,4 a matrix of cofactors cj; = (-1)'7 G;;, where 
Gj; is (i, j-th minor of G. 

In the following proposition we collect some known results about hyperbolic 
tetrahedra (see, e.g., [59]). 


Proposition 2.11. Let T be a compact hyperbolic tetrahedron. Then 
(i) det(G) < 0; 

(ii) cij > 0, i € {1,2, 3, 4}; 

(ili) 27 > Ohi 3 1,7 = 11, 2,334) 

(iv) cosh(¢;;) = Torr 

where €;; is the hyperbolic length of the edge joining vertices i and j. 


Further, we need the following assertion due to Jacobi ([48], Theorem 2.5.1, 
p. 12). 


Proposition 2.12 (Jacobi theorem). Let M = (mi;);j,j=1,....n be a matrix and A = 
det(M) be the determinant of M. Denote by © = (ci;)i,j=1,...,n the matrix of cofac- 
tors Cij = (—1)'*/ det M;;, where M;; is the (n— 1) x (n—1) minor obtained by re- 
moving the i-th row and j -th column of the matrix M. Then for anyk, 1 <k <n-1 
we have 

det(cij)i,;=1,..,4 = A”? det(miz)i,j=k41,....n- 
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One of the main tools for volume calculations in H? and $3 is the classical Schlifli 
formula. 


Proposition 2.13 (Schlafli’s formula). Let X” be a space of constant curvature K. 
Consider a family of convex polyhedra P in X" depending on one or more param- 
eters in a differential manner and keeping the same combinatorial type. Then the 
differential of the volume V = V(P) satisfies the equation 


(n—1)KdV = )0 Vn-2(F) d0(F), 
F 


where the sum is taken over all (n — 2)-facets of P, Vyz—2(F) is (n — 2)-dimensional 
volume of F, and 0(F) is the interior angle along F. 


In the classical paper by Schlafli [52] this formula was proved for the case of a 
spherical n-simplex. For the hyperbolic case, it was obtained by H. Kneser [33] (for 
more details, see also [8] and [40]). 

Now we are able to prove the following theorem (see also [6]). 


Theorem 2.14 (Sforza’s formula in H?). Let T be a compact hyperbolic tetrahedron 
with Gram matrix G. Consider G = G(A) as a function of the dihedral angle A. 
Then the volume V = V(T) is given by the formula 


_ ft c34(A) — ./—det G(A) sin(A) 
ji i/ lg (a + ,/—det G(A) aa) os 


where Ag is a suitable root of the equation det G(A) = 0 and c34 = c34(A) is the 
(3, 4)-cofactor of the matrix G(A). 


Proof. Let A = det(G). By the Jacobi theorem (Proposition 2.12) applied to the 
Gram matrix G form = 4 and k = 2 we obtain 


C33C44 — Cy = AQ = cos?(A)). 


C34 


JC33C44 * 


—A A 
sinh(€4) = \/cosh?(£4) — 1 = econ = Bain v—Asin( yee) 
€33C44 


/C33C44 


By the Cosine rule (Proposition 2.11, (iv)) we get cosh(£4) = Hence, 
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Since exp(+£4) = cosh(€4) + sinh(£4) we have 


c34 £ V—A sin(A) 
exp(+£,4) = “jaa 
Hence, 
enOt) = exp(€4) _ 634 + /—Asin(A) 
exp(—£4) —¢34 — V—Assin(A) 
and 
= i lo (= +/—-A nt) 
as c34 — /—Asin(A)/ 


By the Schlafli formula (Proposition 2.13) for V = V(T) we have 


1 
—-dV = 720 da, a€{A,B,C,D,E, F}. 


By the assumption that the angle A is variable and all the other angles are constant, 
we get 


1 
-dV = 5 faa. 


Note that det(G) —> 0 as A — Ao. Thus, V > 0 as A — Apo. Then integrating both 
sides of the equation we obtain 


A A 
£ 1 —~/-d in(A 
v= [(-B)aa=5 [ tog(2 2 yee ai, 
4 c34 + V—det G sin(A) 


Ao Ao 


where the lower limit Ao is a suitable root of the equation det G(A) = 0. 


In the following proposition we collect some known results about spherical tetra- 
hedra (see, e.g., [36]). 


Proposition 2.15. Let T be a spherical tetrahedron. Then 
(i) det(G) > 0; 

(ii) Cii > 0,7 =1,...,4 
a cij 

(iii) cos(ij) = Wor 


where £;; is the length of the edge joining vertices i and j in ca 
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The next theorem presents a spherical version of Sforza’s formula. 
Theorem 2.16 (Sforza’s formula in $+). Let T be a spherical tetrahedron with Gram 


matrix G. Consider G = G(A) as a function of the dihedral angle A. Then the 
volume V = V(T) is given by the formula 


c34(A) —i./det G(A) sin(A) 


where Ao is a suitable root of the equation det(G(A)) = 0 and c34 = €34(A) is the 
(3, 4)-cofactor of the matrix G(A). 


A * 
Ve a [ove (2X + i/fdet G(A) =) aA, 
4i 
Ao 


The proof is similar to one given in the hyperbolic case. 


2.6 Volumes of hyperbolic octahedra with symmetries 


In this subsection, we consider octahedra in H? and $?. It is known that a regular 
ideal octahedron is a fundamental polyhedron for the complement of the Whitehead 
link embedded into the three-dimensional sphere. Moreover, a hyperbolic manifold 
of minimum volume (the Matveev-Fomenko—Weeks manifold) and many other man- 
ifolds can be obtained by Dehn surgeries on the Whitehead link. Thus, hyperbolic 
octahedra can serve as fundamental polyhedra for a wide class of manifolds, includ- 
ing ones of small volume. The latter seems to be of particular interest when one deals 
with hyperbolic manifolds ordered by increasing volume. 

The problem of finding a volume formula for an arbitrary non-Euclidean polyhe- 
dron is very hard. It was only solved for a general hyperbolic or spherical tetrahedron. 
Even for an octahedron the problem is still open. But in the case of a polyhedron with 
nontrivial symmetry, the calculation of the volume is simplified. This fact was first 
shown by Lobachevsky, who found the volume of an ideal hyperbolic tetrahedron. 
It follows from the definition of an ideal tetrahedron that its opposite dihedral angles 
are pairwise equal (i.e. it has a symmetry). 

Exact formulas for the volumes of spherical octahedra that have mmmz- or 
2|m-symmetry were derived in [3]. The volumes of hyperbolic octahedra with mmm- 
or 3-symmetry were calculated in [4] and [5] respectively. These studies were initi- 
ated by Galiulin, Mikhalev, and Sabitov’s paper [27], in which the authors calculated 
the volumes of Euclidean octahedra with nontrivial symmetry. We show just two 
from the above mentioned results. 
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An octahedron is said to have 2|m-symmetry if it admits a rotation through an 
angle of 21 and a mirror symmetry about a plane orthogonal to the axis of rotation 
(see Figure 11). The 2|m-symmetry of the octahedron corresponds to the group C2 
in the Schoenflies classification, to the group [2,2] in the Coxeter classification, 
or to the group 2* in the orbifold classification (see, e.g., [19], [32]). The volume 
formula for spherical octahedra with 2|m-symmetry is given in [3]. 


N47 


2|m 


Figure 11. An octahedron O = O(A, B,C, D) with 2|m-symmetry 


Theorem 2.17 (Abrosimov, Godoy-Molina, and Mednykh, 2008, [3]). Let O = 
O(A, B,C, D) be a spherical octahedron with dihedral angles A, B,C, D that ad- 
mits a 2|m-symmetry. Then its volume V = V((Q) is given by the formula 


0 
V= 2 | ( arctanh(X cos(t)) + arctanh(Y cos(t)) 
Z + arctanh(Z cos(t)) + arctanh(W cos(r))) ———, 
cos(T) 
where X = cos(A), Y = cos(B), Z = cos ($(C + D)), W = cos ($(C — D)), 
and 0 (0 <0< Zz) is a root of the equation 


X+Y+Z+W 


se 
XYZ+XYW+XZW+YZW 


cos?(0@) + 


Moreover, @ is determined by the sine-tangent rule 


sin(A) _ sin(B) _ sin(5(C + D)) _ sin(5(C — D)) tanita) 
tan(a) tan(b) tan ($(c + d)) tan (4(c —d)) Tenant 
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An octahedron is said to have 3-symmetry if it admits the central symmetry and a 
rotation through an angle of 21 (see Figure 12). The 3-symmetry of the octahedron 
corresponds to the group S¢ in the Schoenflies classification, to the group [2+ , 6+] 
in the Coxeter classification, or to the group 3x in the orbifold classification (see, 
e.g., [19] or [32], Chapter 11). The existence criterion and the volume formula for a 
hyperbolic octahedron with this type of symmetry are given in [5]. 


Figure 12. An octahedron © = O(a, c) with 3-symmetry 


Theorem 2.18 (Abrosimovy, Kudina, and Mednykh, 2015, [5]). Leta,c > 0 be some 
numbers. Then the following statements are equivalent: 


(i) there exists a compact hyperbolic octahedron O(a,c) with edge lengths a,c 
that admits a 3-symmetry; 
(ii) the inequality 
sinh? (4a) 
= a 
sinh (5c) 


holds. 
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Theorem 2.19 (Abrosimov, Kudina, and Mednykh, 2015, [5]). Let O = O(a,c) be 
a compact hyperbolic octahedron with edge lengths a,c that admits a 3-symmetry. 
Then its volume V = V((O) is given by the formula 
a 
V= af t F + (1+ 2cosh(t))cG 
7 (1 + 2cosh(t))A 


where 


F = 1+42cosh(t) + 2cosh?(t) + cosh3(t) 
— 2cosh(c) — 2 cosh(t) cosh(c) — cosh? (c) 
— cosh(t) cosh? (c) + cosh?(t) cosh(c) 
— 4cosh?(t) cosh?(c) + 3 cosh3(c), 


G = sinh(f) sinh(c)(—1 + 2cosh(t)), 


A = (cosh(2c) — cosh(t)) / (—2 cosh(t) + 3 cosh(c))(cosh(t) + cosh(c)). 


The algorithm developed in [5] for obtaining the volume formula is rather univer- 
sal. It can be also applied for other hyperbolic polyhedra with non-trivial symmetry. 

The idea is the following. First, the polyhedron under consideration is modeled in 
Euclidean space. The coordinates of the vertices in IE? are calculated in some param- 
eters according to the symmetry. Then the Euclidean polyhedron is placed into the 
projective Caley—Klein model of hyperbolic space. This gives rise to some additional 
conditions on the coordinates. Vector algebra in projective space is used to express 
hyperbolic edge lengths and dihedral angles in terms of coordinate parameters. Then 
symmetry is used again to exclude non-invariant parameters depending on the choice 
of coordinates. As a result we obtain a direct relation between lengths and angles of 
the hyperbolic polyhedron. These relations allow us to establish an existence crite- 
rion for such polyhedra in H?. Moreover, they allow us to find the solution of the 
Schlafli differential equation responsible for the volume. The latter provides an exact 
integral formula for the volume of the hyperbolic polyhedron. 


Acknowledgement. The research of this article was funded by Russian Science 
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1 Introduction 


A bar-and-joint framework (framework, for brevity) is made of rigid bars connected 
at their ends by universal joints. A framework can be constrained to a plane or allowed 
to move in space. Rigidity of frameworks is a question of practical importance, and 
its mathematical study goes back to the 19" century. Plate-and-hinge structures such 
as polyhedra can be represented by bar-and-joint frameworks through replacement of 
the hinges by bars and rigidifying the plates with the help of diagonals. Thus, rigidity 
questions for polyhedra belong to the same domain. 

There are two ways to approach the rigidity of a framework: through statics, 
i.e. the ability to respond to exterior loads, and through kinematics, i.e. the absence 
of deformations. A framework is called statically rigid if every system of forces 
with zero sum and zero moment can be compensated by stresses in the bars of the 
framework. A framework is called rigid if it cannot be deformed while keeping the 
lengths of all bars, and infinitesimally rigid if it cannot be deformed so that the lengths 
of bars stay constant in the first order. As it turns out, static rigidity is equivalent to 
infinitesimal rigidity. 
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The study of statics has a long history. Systems of forces appear in the textbooks 
of Poinsot [42] and Mobius [37], and the concept of a line-bound force was one of the 
motivations for Grassmann’s introduction of the exterior algebra of a vector space. 

Infinitesimal isometric deformations seem to have appeared first in the context of 
smooth surfaces, see [12] and references therein. In the first half of the 20 century 
the interest in the isometric deformations was stimulated by the Wey] problem, which 
asked whether every positively curved Riemannian metric on the 2-sphere can be 
realized as a smooth convex surface in R*. This was answered in the affirmative 
in the 1950s by Nirenberg and by Alexandrov and Pogorelov. The Weyl problem 
motivated Alexandrov’s works on polyhedra, in particular his enhanced version of 
the Legendre—Cauchy—Dehn rigidity theorem for convex polyhedra. For a survey on 
rigidity of smooth surfaces see [44], [21], [22], and [20]; for rigidity of frameworks 
and polyhedra see [9]. 

The goal of this chapter is to present the fundamental notions and results from the 
rigidity theory of frameworks in Euclidean space and to extend them to hyperbolic 
and spherical geometry. Below we state four main theorems whose proofs are given 
in the subsequent sections. 


Theorem A. A framework in a Euclidean, spherical, or hyperbolic space has equal 
numbers of kinematic and static degrees of freedom. In particular, infinitesimal rigid- 
ity is equivalent to static rigidity. 


The number of static degrees of freedom is the dimension of the vector space of 
unresolvable loads, that is, of those systems of forces applied to the nodes of the 
framework which cannot be compensated by stresses in the bars. (A stress in a bar 
pushes its endpoints apart or pulls them towards each other.) The number of kinematic 
degrees of freedom is the dimension of the vector space of non-trivial infnitesimal 
isometric deformations, that is, of deformations which are not induced by an isometry 
of the ambient space. See Sections 2 and 3 for more detailed definitions and for a 
proof of Theorem A. 


Theorem B (Darboux—Sauer correspondence). The number of degrees of freedom 
of a Euclidean framework is a projective invariant. In particular, a framework is 
infinitesimally rigid if and only if any of its projective images is infinitesimally rigid. 


The projective invariance of static rigidity follows from the interpretation of a 
line-bound vector (a force) in a d-dimensional Euclidean space as a bivector in R¢+1. 
Linear transformations of R@+! preserve static dependences; at the same time they 
generate projective transformations of RP“. See Section 4.1. 
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Theorem C (infinitesimal Pogorelov maps). A hyperbolic or a spherical framework 
has the same number of kinematic degrees of freedom as its geodesic Euclidean im- 
age. In particular, it is infinitesimally rigid if and only if its geodesic Euclidean im- 
age IS. 


By a geodesic Euclidean image of a hyperbolic framework we mean its represen- 
tation in a Beltrami—Cayley—Klein model. A geodesic Euclidean image of a spheri- 
cal framework is its projection from the center of the sphere to an affine hyperplane. 
Every geodesic map of an open region in the hyperbolic or spherical space into Eu- 
clidean space differs from those given above by post-composition with a projective 
map. 

Theorem C is related to Theorem B and is also proved in Section 4.1. In the 
same section we describe the infinitesimal Pogorelov maps that send the static or 
kinematic vector spaces of a framework to the corresponding vector spaces of its 
geodesic image. 

While the previous three theorems hold for frameworks of any combinatorics and 
in spaces of any dimension, the following one is specific to frameworks in dimen- 
sion 2 whose underlying graphs are planar. 


Theorem D (Maxwell—Cremona correspondence). For a framework on the sphere 
or in the Euclidean or hyperbolic plane based on a planar graph, the existence of 
any of the following objects implies the existence of the other two: 


1) a self-stress, 
2) a reciprocal diagram, 


3) a polyhedral lift. 


A self-stress of a framework is a collection of stresses in its bars that resolves a 
zero load. A reciprocal diagram is essentially a framework whose combinatorics is 
dual to the combinatorics of a given framework (nodes correspond to faces, bars to 
bars, and the corresponding bars are perpendicular). Finally, a polyhedral lift in the 
Euclidean case is a vertical lift to R? such that the images of the vertices of every 
face are coplanar. Definitions of reciprocal diagrams and polyhedral lifts slightly 
differ in different geometries. Also, Theorem D has various versions all of which are 
presented in Section 5. 

The theory of isometric deformations extends to the smooth case in a straight- 
forward way (and, as we already mentioned, probably preceded the kinematics of 
frameworks). Accordingly, there are analogs of Theorems B and C for smooth sub- 
manifolds of Euclidean, hyperbolic or spherical space. In fact, Theorem B was proved 
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by Darboux for smooth surfaces and only later by Sauer for frameworks [46]. Also 
Theorem C was first proved by Pogorelov in [41, Chapter 5] for smooth surfaces. 
On the other hand, a theory of statics for smooth surfaces containing an analog of 
Theorem A is not fully developed or at least not widely known. (See however the 
dissertation of Lecornu [31].) 

Let us set up the notation used throughout the chapter. In the following, X% stands 
for either E? (Euclidean space) or $@ (spherical space) or H¢ (hyperbolic space). We 
often view these spaces as subsets of the real vector space R¢*!: 


EB? = {x € R4t! | x9 = 1}, 
S¢ = {x eR | (x,x) = 1), 


H? = {x € R477 | (x, x)a1 =—1, x0 > 0}. 


Here (-, -) is the Euclidean scalar product, and (-, -)¢,; is the Minkowski scalar prod- 
uct: 


(x, Y)da = —XoYo + X1y1 +++ + Xaya- 


Since all formulas involving the scalar products are identical in the spherical and in 
the hyperbolic case, we omit the subscripts and denote by (-,-) both the Euclidean 
and the Minkowski scalar products. Sometimes we also use sinx and cosx to denote 
sin and cos in the spherical and sinh and cosh in the hyperbolic case. 


2 Kinematics of frameworks 
2.1 Motions 


Let T’ be a graph; we denote its vertex set by Ip and its edge set by Ty. For the 
vertices of I we use the symbols i, 7 etc. The edges are unordered pairs of elements 
of To, and for brevity we usually write i7 instead of {i, 7} € Ty. 


Definition 2.1. A framework in X@ is a graph T together with a map 
ply, it > pj 


such that p; # p; whenever {i, j} € Ty. If X? = 94, then we additionally require 
pi # —p; for all {i, 7} € T1. 
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This is a mathematical abstraction of a bar-and-joint framework, see the introduc- 
tion. Note that we allow intersections between the edges. 

In a framework (I, p), every edge receives a non-zero length dist(p;, p;). 
Two frameworks (IT, p) and (IT, p’) with the same graph are called isometric if they 
have the same edge lengths: dist(p;, pj) = dist(p;, Pj) for all {i, 7} € I. Frame- 
works with the same graph are called congruent if there is an ambient isometry 
® € Isom(X) such that PD, = ®(p;) for alli € To. 


Definition 2.2. A framework (I, p) is called globally rigid if every framework iso- 
metric to (IT, p) is also congruent to it. 


An isometric deformation of a framework (I, p) is a continuous family of iso- 
metric frameworks (I, p(t)), where t € (—e,¢€) and p(O) = p. An isometric 
deformation is called trivial if it is generated by a family of ambient isometries: 


pilt) = ®:(pi). 


Definition 2.3. A framework (I, p) is called rigid (or locally rigid) if it has no non- 
trivial isometric deformations. A non-rigid framework is also called flexible. 


Clearly, global rigidity implies rigidity, but not vice versa. See Figure 1. 


Figure 1. Frameworks in the plane. Left: globally rigid. Middle: rigid but not globally rigid. Right: 
flexible. 


2.2 Infinitesimal motions 
Definition 2.4. A vector field on a framework (T, p) is a map 
gis TR, fe 


such that gj € Tp, X@ for all i. A vector field is called an infinitesimal isometric 
deformation of (I, p) if for some (and hence for every) smooth family of frameworks 
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(T, p(t)) such that 


Pi(t) = q@ for alli eT, 
t=0 


d 
p(O) = p, ah 


we have 


dist(pi(t), pj(t)) = 0 


t=0 


for all {i, 7} ET). 
Clearly, the infinitesimal isometry condition is equivalent to 
(qi. iz) + (qj. ei) = 9, (1) 
where ej; € Ty, X@ is the unit vector tangent to the geodesic are pj; pj. 


Lemma 2.5. A vector field q is an infinitesimal isometric deformation of a framework 
(T, p) if and only if 


(Pi — Pj. i-4j) =9 inE®, 
(pi.dj) + (gis pj) =0 in S4 or H4. 


Here (p;,q;) means the Euclidean, respectively Minkowski scalar product in 
IR¢+!, which makes sense if we identify Tp; X@ with a linear subspace of R¢*!. 


Proof. This follows from (1) and 


ee in E4 

lp; — Pill 

aE De in $% and H% 
sing ¢;; , 


where = ¢;; = dist(p;, p;) 


An infinitesimal isometric deformation is called trivial if there is a Killing field 
K on X@ such that g; = K(p;) for all i. 


Definition 2.6. A framework (I, p) is called infinitesimally rigid, if it has no non- 
trivial infinitesimal isometric deformations. 
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Theorem 2.7. An infinitesimally rigid framework is rigid. 


For a proof, see [19], [2], and [8]. 


The converse of Theorem 2.7 is false, see Figure 2. 


Figure 2. A rigid but infinitesimally flexible framework 


Similarly to the example in Figure 2, one can construct a non-trivial infinitesimal 
isometric deformation for every framework contained in a geodesic subspace of X@ 
(provided that the framework has at least d + 1 vertices). This is one of the reasons 
why it is convenient to consider only spanning frameworks: those whose vertices are 
not contained in a geodesic subspace. 


Denote the set of all infinitesimal isometric deformations of a framework (T, p) 
by V(I, p). Due to Lemma 2.5, V(T, p) is a vector space. The set of trivial infinitesi- 
mal isometric deformations is also a vector space; we denote it by Vo(T, p). If (1, p) 
is spanning, then dim Vo(I’, p) = aoneee 


Definition 2.8. The dimension of the quotient space V(T, p)/Vo(I, p) is called the 
number of kinematic degrees of freedom of a framework (I, p). 


In particular, infinitesimally rigid frameworks are those with zero kinematic de- 
grees of freedom. 


Remark 2.9, Determining whether a framework is flexible is more difficult than 
determining whether it is infinitesimally flexible: the latter is a linear problem, the 
former is an algebraic one. Examples of Bricard octahedra and Kokotsakis polyhedra 
in Section 2.7 illustrate this. 
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2.3 Point-line frameworks 


A point-line framework in R? associates to every vertex i of I either a point p; or a 
line /; in R?. The edges of I correspond to the constraints of the form 


dist(pi, pj) = dist(p;, p;), dist(pi, 1;) = dist(p;,1;), 2Ui,1j)) = ZU, 0;). 2) 


For recent works on point-line frameworks see [27] and [15]. 

In spherical geometry, a point-line framework is equivalent to a standard frame- 
work. If we replace every great circle by one of its poles, then the last two constraints 
in (2) take the form of the first one. 

In hyperbolic geometry, the pole of a line is a point in the de Sitter plane (the 
complement of the disk in the projective model of H?). Therefore the study of point- 
line frameworks in H? can be reduced to the study of standard frameworks in the 
hyperbolic-de Sitter plane. Moreover, we can allow ideal points, which means as- 
signing horocycles to some of the vertices of I’ and fixing the point-horocycle, line- 
horocycle and horocycle-horocycle distances. 


2.4 Constraints counting 


One can estimate the dimension of the space of non-congruent realizations of a frame- 
work by counting the constraints. If |[9| = and || = m, then there are m equa- 
tions on dn vertex coordinates. Besides, one has to subtract the dimension of the 
space of trivial motions, which is eer) for spanning frameworks. Thus, generi- 
cally a framework in X@ with n vertices and m edges has dn — m — ater) degrees 
of freedom. 

Of course, the above arithmetics does not make much sense without the combi- 
natorics (we can put a lot of edges on a subset of the vertices, allowing the other 
vertices to fly away). Laman [30] has shown that in dimension 2 the arithmetics and 
combinatorics suffice to characterize generic rigidity. A graph I" is called a Laman 
graph if |[| = 2|Co| — 3 and every induced subgraph of I with k vertices has at 
most 2k — 3 edges. 


Theorem 2.10. A Laman graph is generically rigid, that is, the framework (I, p) is 
rigid for almost all p. 


No analog of the Laman condition is known for frameworks in higher dimensions. 
See [10] for more details on generic rigidity. 
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If all faces of a 3-dimensional polyhedron homeomorphic to a ball are triangles, 
then its graph satisfies |I',| = 3|I9|—6, that is, the above count gives 0 as an estimate 
for degrees of freedom. Rigidity of polyhedra is discussed in the next section. 


2.5 Frameworks and polyhedra 


One may try to generalize bar-and-joint frameworks by introducing panel-and-hinge 
structures: rigid polygons sharing pairs of sides and allowed to freely rotate around 
these sides, or even more generally n-dimensional “panels” rotating around (” — 1)- 
dimensional “hinges.” A mathematical model for such an object is called a poly- 
hedron or a polyhedral complex. However, there is a way to replace a polyhedral 
complex by a framework without changing its isometric deformations (global as well 
as local and infinitesimal). For this, replace every panel by a complete graph on its 
vertex set. This “rigidifies” the panels and leaves them the freedom to rotate around 
the hinges. 


A particular class of polyhedral complexes is the one of convex polyhedra. Ac- 
cording to the Legendre—Cauchy theorem (see [32] and [7]), a convex polyhedron 
is globally rigid among convex polyhedra. There are simple examples of convex 
polyhedra isometric to non-convex ones. By the Dehn theorem [14] (that can also 
be proved by the Legendre—Cauchy argument), convex 3-dimensional polyhedra are 
infinitesimally rigid. 

The Legendre—Cauchy argument applies to spherical and hyperbolic convex poly- 
hedra as well. This allows to prove the rigidity of convex polyhedra in X@ for d > 3 
by induction: the link of a vertex of a d-dimensional convex polyhedron is a (d — 1)- 
dimensional spherical polyhedron, and the rigidity of links implies the rigidity of the 
polyhedron. 


A simplicial polyhedron (that is, one all of whose faces are simplices) has the 
same kinematic properties as its 1-skeleton. In a convex non-simplicial polyhedron 
we can replace every face by a complete graph as described in the first paragraph; 
but in fact a much “lighter” framework is enough to keep the polyhedron rigid. It 
suffices to triangulate every 2-dimensional face in an arbitrary way (without adding 
new vertices in the interior of the face, vertices on the edges are all right). Again, the 
Legendre—Cauchy argument ensures the rigidity of all 3-dimensional faces, and the 
induction applies as in the previous paragraph, see [1, Chapter 10] and [52]. 

As already indicated, the cone over a framework in $4 can be viewed as a panel 
structure (or a framework) in E¢+!, Similarly, a framework in H?@ leads to a frame- 
work in the (d + 1)-dimensional Minkowski space. 
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2.6 Averaging and deaveraging 


There is an elegant relation between the infinitesimal and global flexibility. (For 
smooth surfaces, this idea goes back to the 19""th century.) 


Theorem 2.11. 1) DEAVERAGING. Let (I, p) be a framework in X@ with a non- 
trivial infinitesimal isometric deformation q. Define two new frameworks (T, p*) 
and (I, p_) as follows. 


Pi=Pitd, Pi =Pi-di forX=E, 
prece = s . age Lay for X =SorH, 
lpi + gill Il pi — ill 
where ||v|| = /|(v, v)|, and in the hyperbolic case p; + q; are assumed to be light- 


like. 
Then the frameworks (I, p+) and (YT, p~) are isometric, but not congruent. 


2) AVERAGING. Let (I, p’) and (T, p”) be two isometric non-congruent frame- 
works in X@. Put 


pi + py p. — py 
ae aa a ae forX=E, 
P44. pl Ppl 
peo Ei , Peas al ae ae , forX=SorHl. 
IP; + P; I IP; + Pp; I 


Then q is a non-trivial infinitesimal isometric deformation of (T, p). 


In the deaveraging procedure it might happen that pi = PF for some {i, 7} € Ty, 
so that p* is not a framework. To avoid this, one can replace q by cq for a generic 
c € R. In the hyperbolic case, choosing c sufficiently small will make p; + cq; 
light-like. 


Proof. Formulas of averaging are inverses to those of deaveraging, and both state- 
ments can be proved by a direct calculation. Note that in the spherical and the hy- 
perbolic cases we have || p; + gil] = || pi — qi|| due to (p;, g;) = 0. Also g is non- 
trivial if and only if it changes the distance in the first order between some p; and pj; 
not connected by an edge. One can check that this is equivalent to dist( Dt F P;) a 


dist(p; , D; )- 
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2.7 Examples 


In Section 2.4 we spoke about generically rigid graphs. The most interesting exam- 
ples of flexible frameworks are special realizations of generically rigid graphs. 


Example 2.12 (a planar framework with 9 edges on 6 vertices). The frameworks 
on Figure 3 (which are combinatorially equivalent) are infinitesimally flexible if and 
only if the lines a, b, c are concurrent, that is, meeting at a point or parallel. This can 
be proved with the help of the Maxwell—Cremona correspondence, see Example 5.4. 


Cc 
. 
< 
. 
. 
rs 
7 7 
ae Oe 
ear ~N 
Se, SS 
~ > 
------- -O 


Figure 3. Infinitesimally flexible frameworks in the plane 


Example 2.13 (another planar framework with 9 edges on 6 vertices). A framework 
based on the bipartite graph K3,3 is infinitesimally flexible if and only if its vertices 
lie on a (possibly degenerate) conic, see Figure 4. For a proof see [5] and [51]. 
On the left-hand side, the vertices lie on a circle; the arrows indicate a non-trivial 
infinitesimal isometric deformation. 


i x 


Figure 4. Infinitesimally flexible frameworks in the plane 
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The conditions in the above two examples are projectively invariant. Besides, 
the same criteria hold for frameworks on the sphere or in the projective plane. (Three 
lines in H? are called concurrent if they meet at a hyperbolic, ideal, or de Sitter point.) 
A non-Euclidean conic is one that is depicted by an affine conic in a geodesic model 
of $7 or H?, see [23]. 


Example 2.14 (Bricard’s octahedra and Gaifullin’s cross-polytopes). Flexible oc- 
tahedra (with intersecting faces) were discovered and classified by Bricard [6], see 
also [49] and [38]. A higher-dimensional analog of the octahedron is called cross- 
polytope. Recently, flexible cross-polytopes in X@ were classified by Gaifullin [18]. 


Example 2.15 (infinitesimally flexible octahedra). While the description and classi- 
fication of flexible octahedra requires quite some work, infinitesimally flexible octa- 
hedra can be described in a simple and elegant way. 


Color the faces of an octahedron white and black so that adjacent faces 
receive different colors. An octahedron is infinitesimally flexible if and 
only if the planes of its four white faces meet at a point (which may lie 
at infinity). As a consequence, the planes of the white faces meet if and 
only if those of the black faces do. 


This theorem was proved independently by Blaschke and Liebmann [4] and [33]. 
The configuration is related to the so-called Mobius tetrahedra: a pair of mutually 
inscribed tetrahedra, [36]. 

Figure 5 shows two examples of infinitesimally flexible octahedra. The one on 
the left is a special case of the Schoenhardt octahedron [47]; its bases are regular 
triangles, and the orthogonal projection of one base to the other makes the triangles 
concentric with pairwise perpendicular edges. 


Theorem B implies that infinitesimally flexible octahedra in $7 and H? are char- 
acterized by the same criterion as those in E?. In hyperbolic space, the intersection 
point of four planes may be ideal or hyperideal. In fact, even the vertices of an oc- 
tahedron may be ideal or hyperideal. Infinitesimally flexible hyperbolic octahedra 
were used in [25] to construct simple examples of infinitesimally flexible hyperbolic 
cone-manifolds. 


Example 2.16 (Jessen’s icosahedron and its relatives). In the xy-plane of R?3, take 
the rectangle with vertices (+1, +, t,0), where 0 < t¢ < 1. Take two other rectangles, 
obtained from this one by 120° and 240° rotations around the x = y = z line (which 
results in cyclic permutations of the coordinates). The convex hull of the twelve 


V5-1 
2 


vertices of these rectangles is an icosahedron (a regular one for tf = 
the edges of this icosahedron are the short sides of the rectangles. 


). Among 
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Cc 


Figure 5. Infinitesimally flexible octahedra. On the right, the points A, B, C, D must be coplanar. 


Modify the 1-skeleton by removing the short sides of rectangles (like the one join- 
ing (1, t, 0) with (1, —7, 0)) and inserting the long sides (like the one joining (1, t, 0) 
with (—1,7,0)). The resulting framework p(t) is the 1-skeleton of a non-convex 
icosahedron. Jessen [28] gave the t = ; non-convex icosahedron as an example of 
a closed polyhedron with orthogonal pairs of adjacent faces, but different from the 
cube. See Figure 6. 

The framework p(t) has two sorts of edges: the long sides of the rectangles, 
which have length 2, and the sides of eight equilateral triangles, which have length 
J 2(t? —t + 1). It follows that the frameworks p(t) and p(1—1) are isometric. Note 
that p(0) collapses to an octahedron: the map p(t): [9 —> R? sends the vertices of the 
icosahedral graph to the vertices of a regular octahedron by identifying them in pairs; 
there are three pairs of edges that are mapped to three diagonals of the octahedron. At 
the same time, p(1) is the graph of the cuboctahedron with square faces subdivided 
in a certain way. 

Since the average of p(t) and p(1 — fr) (in the sense of Section 2.6) is p(3), it 
follows that Jessen’s icosahedron is infinitesimally flexible. 

Theorem B implies that there are spherical and hyperbolic analogs of this con- 
struction. 


Example 2.17 (Kokotsakis polyhedra). A Kokotsakis polyhedron with an n-gonal 
base is a panel structure made of a rigid n-gon, and n quadrilaterals attached to its 
edges, and n triangles attached between the quadrilaterals, see Figure 7, left. Gener- 
ically, a Kokotsakis polyhedron is rigid; it is flexible for certain symmetric configu- 
rations, see Figure 7, right. 
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Figure 6. Jessen’s orthogonal and infinitesimally flexible icosahedron 


Especially interesting are the polyhedra with a quadrangular base, because of 
their relation to quad-surfaces (polyhedral surfaces made of quadrilaterals with four 
quadrilaterals around each vertex). A quad-surface is (infinitesimally) flexible if and 
only if all Kokotsakis polyhedra around its faces are. A famous example of a flexible 
quad-surface is the Miura-ori [39]. 

A characterization of infinitesimally flexible Kokotsakis polyhedra was given by 
Kokotsakis in [29]; several flexible examples were constructed in [45] and [29]. 
A complete classification of flexible polyhedra with a quadrangular base is given 
in [26]. 


3 Statics of frameworks 
3.1 Euclidean statics 


In the statics of a rigid body, a force is represented as a line-bound vector: moving 
the force vector along the line it spans does not change its action on a rigid body. 
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Figure 7. Kokotsakis polyhedra 


Definition 3.1. A force in a Euclidean space is a pair (p, f) with p € EY, f € R?. 
A system of forces is a formal sum )°; (pi, f;) that may be transformed according to 
the following rules. 


0) A force with a zero vector is a zero force: 
(p,0) ~ 0. 
1) Forces at the same point can be added and scaled as usual: 
Ai(p, fi) + A2(p, f2) ~ (DAL Ai + A2f2). 
2) A force may be moved along its line of action: 


(pv. f)~(pt+AF f). 


One may check from this definition that systems of forces form a vector space of 
dimension ae 

In I, any system of forces is equivalent either to a single force or to a so called 
“force couple” (pi, f) + (p2,—f), where the vector f is not parallel to the line 
through p; and po. 


Definition 3.2. A load on a Euclidean framework (I, p) is a map 


ft Re, 
it> fj. 


A load is called an equilibrium load if the system of forces )0; ep 0 (pi, fi) is equiv- 
alent to a zero force. 
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A rigid body responds to an equilibrium load by interior stresses that cancel the 
forces of the load. This motivates the following definition. 


Definition 3.3. A stress on a framework (I, p) is a map 


w:T,; —R, 
ij -> Wij = Wii. 


The stress w is said to resolve the load f if 


fi= > wij(pj — pi) foralli €To, (3) 
JEYo 


where we put w;; = 0 for all ij ¢ 1). 


We denote the vector space of equilibrium loads by F(T, p), and the vector space 
of resolvable loads by Fo(T, p). It is easy to see that every resolvable load is an 
equilibrium load: Fo(T, p) C F(L, p). 


Definition 3.4. The dimension of the quotient space F(T, p)/Fo(T, p) is called the 
number of static degrees of freedom of the framework (T, p). 

The framework (I, p) is called statically rigid if it has zero static degrees of 
freedom, i.e. if every equilibrium load can be resolved. 


3.2 Non-Euclidean statics 


Definition 3.5. Let X? = $¢ or H@. A force in X@ is an element of the tangent 
bundle 7X4. We write it as a pair (p, f) with p € X@ and f € TMs 

A system of forces is a formal sum of forces that may be transformed according to 
the rules of Definition 3.1, where the formula in the rule 2) is replaced by (p, f) ~ 
(exp, (Af), t(f)) with t(f) being the result of the parallel transport of f along the 
geodesic from p to exp, (Af). 


A system of forces on $7 is always equivalent to a single force; a system of forces 
on H? is equivalent to either a single force, or an ideal force couple or a hyperideal 
force couple. 


Definition 3.6. A load on a framework (I, p) in X@ is a map 
fit S. jer. 


A load is called an equilibrium load if the system of forces )0; ep  (Pi» fi) is equiv- 
alent to a zero force. 
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In the above definitions, X@ can also stand for E%. The canonical isomorphisms 
T,E¢ = R?@ result in simplified formulations given in the preceding section. 

As in the Euclidean case, a stress on a framework in X4 isa map w:T; > R. 
A stress w resolves a load f if 


fi= Do wy lier. 
JET 


The following lemma gives an alternative description of the stress resolution. 


Lemma 3.7. A stress w resolves a load f on a framework (T, p) in X4 = S4 or 
H?¢ if and only if for every i € V9 we have 
fi— Do day vy Il Bi. 
JETo 
where 
Foes poe 
a " siny €;; 


Here fi, pi € R4*! via X? C R4*!, and v || w means that the vectors v and w are 
linearly dependent. 


Proof. Follows from the formula for e;; given in the proof of Lemma 2.5. 


Pj — cosx dist(pi, pj) Pi = Pj — (Pi. Pj) Pi = Sinx dist(p;, pj) - eij. 


3.3 Equivalence of static and infinitesimal rigidity 


Define a pairing between vector fields and loads on a framework (T, p): 


at= > Gah): (4) 


i€To 


This pairing is non-degenerate and therefore induces a duality between the space of 
vector fields and the space of loads. 


Lemma 3.8 (principles of virtual work). Under the pairing (4), 


1) the space of infinitesimal motions is the annihilator of the space of resolvable 
loads: 
VI, p) = Fo(T. p)*: 
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2) the space of trivial infinitesimal motions is the annihilator of the space of equi- 
librium loads: 


Vo(T, P) = F(T, Pp)’. 


A proof in the Euclidean case can be found in [24]; it transfers to the spherical 
and the hypebolic cases. 


As a consequence, the pairing (4) induces an isomorphism 


VT. p)/ Voll, p) = (FL, p)/Fo(T, p))” (5) 


which implies Theorem A. 


The statics of a Euclidean framework is formulated in purely linear terms: loads 
and stresses on a framework correspond to loads and stresses on its affine image. 
Together with Theorem A this leads to the following conclusion, which is a special 
case of Theorem B. 


Corollary 3.9. The number of kinematic degrees of freedom of a Euclidean frame- 
work is an affine invariant. In particular, an affine image of an infinitesimally rigid 
framework is infinitesimally rigid. 


Definition 3.10. The rigidity matrix of a Euclidean framework (T, p) is a Ty x To 
matrix with vector entries: 


RV, p) = ii] + Pi — Dj 


It has the pattern of the edge-vertex incidence matrix of the graph I’, with p; — p; 
on the intersection of the row ij and the column /. 


The rows of R(T’, p) span the space Fo(I’, p). The following proposition is a 
reformulation of the first principle of virtual work. 
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Lemma 3.11. Consider R(T, p) as the matrix of a map (R42) — R!"!. Then the 
following holds: 


ker R(T, p) = VC, p); 


im R(T, p)' = Fo(L, p). 


Corollary 3.12. A framework (I, p) is infinitesimally rigid if and only if 


‘oe 
rk R(P. p) = d [Tol = ( . ). 


4 Projective statics and kinematics 
4.1 Projective statics 
For X¢ = E24, $? or H4, associate to a force (p, f) in X@ a bivector in R@+!: 


(p, f)-> paf. (6) 


We use the canonical embeddings X? Cc R¢+! that allow to view a point p anda 
vector f as vectors in R¢*!. 


Lemma 4.1. The map (6) extends to an isomorphism between the space of systems 
of forces on X@ and the second exterior power AACReT"), 


The equivalence relations from Definition 3.1 ensure that a linear extension is 
well-defined. For a proof of its bijectivity, see [24]. 
The above observation motivates the following definitions. 


Definition 4.2. A projective framework is a graph I’ together with a map 
m:T9 —> RP?, iron, 
such that 7; A m; forij € 1). 
We say that @ € A2(IR¢*!) is divisible by a vector v if ¢ = v A w for some 


vector w. Similarly, we say that @ is divisible by x € RP? if ¢ is divisible by a 
representative of z. 
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Definition 4.3. A load on a projective framework (I, 7) is a map 
¢:T%y9 — A2(R4t1), i> gi, 


that sends every vertex i to a bivector divisible by 2;. An equilibrium load is one 


that satisfies 
3 gi = 0. 


i€To 


Definition 4.4. Denote by I") the set of oriented edges of the graph I’. A stress on 
a projective framework (I, 7) is a map 


OTe —s ARE Gye we 


such that @;; is divisible by both x; and z;, and aj; = —@;j. 
A stress Q is said to resolve a load ¢ if 


gi = » Wij. 


JETo 


The projectivization of a framework (I, p) in X@ is obtained by composing p 
with the inclusion X? C R@+! and the projection R¢+!\{0} — RP2. The following 
lemma is straightforward. 


Lemma 4.5. The map (6) sends bijectively the equilibrium, respectively resolvable, 
loads on a framework in X@ to the equilibrium, respectively resolvable, loads on its 
projectivization. 


Theorems B and C are immediate corollaries of Lemma 4.5. 


Proof of Theorem B. Two frameworks in E® are projective images of one another 
if and only if their projectivizations are related by a linear isomorphism of R¢+?. 
A linear map sends equilibrium loads to equilibrium ones, and resolvable to resolv- 
able ones. 


It seems that Theorem B was first proved by Rankine [43] in 1863. He stated 
that static rigidity is a projective invariant but did not give the details, just saying 
that “[...] theorems discovered by Mr. Sylvester [...] obviously give at once the 
solution of the question.” The first detailed accounts are [33] (for a special case 
[P1| = d|Fo| — “S*) and [46]. 


Proof of Theorem C. A Euclidean framework and its geodesic spherical or hyper- 
bolic image have the same projectivizations. Hence the maps (6) yield an isomor- 
phism between the spaces of their equilibrium/resolvable loads. 
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4.2 Static and kinematic Pogorelov maps 


Let a framework (I, p) in E®% and a projective map ®: RP? — RP? be given such 
that the image of ® o p is contained in E?. (Here RP®@ is a projective completion 
of E7). Lemma 4.5 does not only show that the spaces of equilibrium modulo re- 
solvable loads of (I, p) and (1, ®o p) have the same dimension, but also establishes 
a canonical up to a scalar factor isomorphism between these spaces. Through the 
static-kinematic duality from Section 3.3 this also yields an isomorphism between 
the spaces of infinitesimally isometric modulo trivial motions. 

The situation is similar with the geodesic correspondence between frameworks in 
different geometries. The kinematic isomorphisms were described by Pogorelov in 
[41, Chapter 5] together with the maps that associate to a pair of isometric polyhedra 
in one geometry a pair of isometric polyhedra with the same combinatorics in the 
other geometry (this is related to the kinematic isomorphism via the averaging pro- 
cedure, see Section 2.6). We will use the term Pogorelov maps in each of the above 
situations. 


Definition 4.6. Let ¥ c X¢ andY c Y4,X,Ye (IE, 5, H}, and let ®: X¥ — Y bea 
geodesic map. A fiberwise linear map ®**': TX — TY with ®*'(7,X) C Tap) Y 
is called a static Pogorelov map associated with ® if for every framework (T, p) in 
X the following two conditions are satisfied: 


* aload f on (I, p) is in equilibrium if and only if the load ®°*' o f on the 
framework (I, ® o p) is in equilibrium; 


* aload f on (I, p) is resolvable if and only if the load ®*'*'o f on the framework 
(1, ® o p) is resolvable. 


A fiberwise linear map ®*": TX — TY with ®*"(7,X) C Tacpy¥ is called a 
kinematic Pogorelov map associated with ® if for every framework (I, p) in X the 
following two conditions are satisfied: 


* avector field g on (I, p) is an infinitesimal isometric deformation if and only if 
the vector field ®* og on (I, ®o p) is an infinitesimal isometric deformation; 


* a vector field g on (I, p) is a trivial infinitesimal isometric deformation if and 
only if the vector field b' 0 g on (I, ®o p) is a trivial infinitesimal isometric 
deformation. 


Remark 4.7. The last condition on a kinematic Pogorelov map means that ®* sends 
Killing fields on X to Killing fields on Y. For an intrinsic approach to the Pogorelov 
maps defined for Riemannian metrics with the same geodesics, see [16, Section 4.3]. 
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Lemma 4.8. If *'*' is a static Pogorelov map associated with ®, then ((®*‘)~!)* 
is a kinematic Pogorelov map associated with ®. 


Proof. This follows from 


(((B*)71)* (gq), O()) = (g, (OM)? 0 OG) = (Gg, f) 


and from Lemma 3.8. 


4.3 Pogorelov maps for affine and projective transformations 


Theorem 4.9, Let ®:E¢ — E® be an affine transformation with the linear part 
A=d®eGL(n,R). Then 


stat -—A opkin = Cm 
are static and kinematic Pogorelov maps for ®. 


Proof. The equivalence relation in Definition 3.1 is affinely invariant. Therefore f is 
an equilibrium load on (I, p) if and only if Ao f is an equilibrium load on (T, ®o p). 
When (I, p) is transformed by ®, the right-hand side of (3) is transformed by A. 
Therefore a stress that resolves f also resolves A o f. 


Theorem 4.10. Let 
oh? i Be 
be a projective transformation, where L is the hyperplane sent to infinity, and L' 


the image of the hyperplane at infinity. Denote by hi (p) the distance from a point 
pe E¢ to the hyperplane L. Then 


DS = hz (p)-d®,, OF" = h,*(p)- ((d®p)*) * 
are static and kinematic Pogorelov maps for ®. 


Proof. A projective map © consists of a linear transformation M € GL(d + 1, R) 
restricted to E? followed by the central projection from the origin to E¢. We need to 
compose the map (6) with M,: A2(IR¢2+!) — A?(R¢+!) and then with the inverse 
of (6). 
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The map (6) followed by M, transforms a force (p, v) as follows: 
(p,v) > pAV= pA(ptv)t> M(p)AM(p+v). 
We have 


hut)(M (p)) 


M(p) = TB n M(B4), ML) 


O(p) =c-hz(p): P(p) 


for some c € R, where the distances are taken with a sign, see Figure 8. It follows 
that 


M(p) \ M(p + v) =c?-hy(p)-hy(p + v)- (p) A O(p + v). 


M(E*) 


M(L) 0 


Figure 8. Computing the Pogorelov map for a projective transformation 


Applying the inverse of (6) we see that the vector v at p is transformed into the 
vector 


c? hy (p)-hr(p + v)- (®(p + v) — ®(p)) 


at ®(p). By construction, this transformation is linear in v. Therefore it does not 
change if we replace v by tv and take the derivative with respect to ¢ at t = 0. This 
derivative equals c*hy (p)d®,(v). This proves the formula for ®**'. The formula 
for &" follows from Lemma 4.8. 
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4.4 Pogorelov maps for geodesic projections of $7 and H?@ 


Theorem 4.11. Let G:E? — X be the projection from the origin of R¢+, where 
X= ox =H. 

Then the Pogorelov maps for a Euclidean framework (I, p) and its spherical, 
respectively hyperbolic, image (I, G o p) are given by 


1 


Gitat = Wal -dGp. Gkin = 
p IIp Il 


*\—1 
‘ (dGs). 


Here ||- || = 


(.-)]. 


Note that in the spherical case at the point eo (the tangency point of X with E7) 
we have G3" = dGey. 


Proof. To compute the image of v € TE under the differential dG), project the 
geodesic p + tv in E% to X. Then d G,(v) is the velocity vector of the projected 
curve at tf = 0, see Figure 9, top, that illustrates the case of the sphere. On the other 
hand, the image of v under the static Pogorelov map is determined by 


G(p) A Gy" (v) = pv. 


Hence both dG, (v) and Go" (v) are linear combinations of p and v tangent to $4. 
It follows that these two vectors are collinear: 


Go (v) =AX(p,v)-dGp(v), A(p,v) ER. 


If the images of every vector under two linear maps are collinear, then these maps 
are scalar multiples of each other. Thus A depends on p only: 


Gy = A(p)-dGp. 
For small ft, the ratio of the areas of the triangles on Figure 9, top, is equal to || p||. 


Hence 


1 
G(p) \ dGp(v) = pl? Av, 


which implies the first formula of the theorem. The second formula follows from the 
duality between infinitesimal deformations and loads. 
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Figure 9. Computing the Pogorelov map for a geodesic projection to the sphere 
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5 Maxwell—Cremona correspodence 


5.1 Planar 3-connected graphs, polyhedra, and duality 


A graph is called 3-connected if it is connected, has at least 4 vertices, and remains 
connected after removal of any two of its vertices. In particular, every vertex of a 
3-connected graph has degree at least 3. 


Planar 3-connected graphs have very nice properties. First, by a result of Whit- 
ney [54], their embeddings into $? split in two isotopy classes that differ by an 
orientation-reversing diffeomorphism of $?. Second, by the Steinitz theorem (see [50] 
and [55]), a graph is planar and 3-connected if and only if it is isomorphic to the skele- 
ton of some convex 3-dimensional polyhedron. Whitney’s theorem implies that for a 
planar 3-connected graph T there is a well-defined set of faces ['2. Geometrically, a 
face is a connected component of $? \ #(I'), where ¢ is an embedding of I’; combi- 
natorially it is the set of vertices on the boundary of such a component. We call (a, 7) 
with a € 12,7 € [9 andi € aw an incident pair. The choice of an isotopy class of an 
embedding T — $7 and of an orientation of $ induces a cyclic order on the set of 
vertices incident to a face. 


The dual graph Y* of a planar 3-connected graph T' can be constructed from an 
embedding [ — $7 by choosing a point inside every face and joining every pair 
of points whose corresponding faces share an edge. The graph I* is also planar 
and 3-connected, and its dual is again I’. If an edge ij of I’ separates the faces a 
and £, then we say that (a@B,ij) is a dual pair of edges. Choose an isotopy class of 
embeddings I —> $? and fix an orientation of $7. Then we say that the pair (af, i) 
is consistently oriented if the face a lies on the right from the edge ij directed from 
i to j, see Figure 10. Changing the order of i and j or of w and 6 transforms an 
inconsistently oriented pair into a consistently oriented one. 


Figure 10. A consistently oriented dual pair 
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5.2 Maxwell—Cremona theorem 
For convenience we identify in this section E? with R? by choosing an origin. 


Definition 5.1. Let (I, p) be a framework in R? with a planar 3-connected graph I. 
A reciprocal diagram for (T, p) is a framework (['*,m) such that dual edges are 
perpendicular to each other: 


Mp —-Maq L pj — Di 


whenever the edge ij of I separates the faces a and £. 


Definition 5.2. Let (IT, p) be a framework in R? with a planar 3-connected graph 
T and such that for every face a € [2 the points {p; | i € a} are not collinear. 
A vertical polyhedral lift of (T, p) is a map p: 9 — R? such that 


1) pry op = p, where pr, : R? > R? is the orthogonal projection; 
2) for every face a of (IT, p) the points {p; | i € a} are coplanar; 
3) the planes of the adjacent faces differ from each other. 


A radial polyhedral lift of (T, p) is a map p: 9 — R? that satisfies the above con- 
ditions with 1) replaced by 
1’) pr, op = p, where pr,: R? \ {a} > R? is the radial projection from a point 
a ¢ R?. 


It turns out that reciprocal diagrams are related to polyhedral lifts and to the statics 
of the framework (T, p). 

A stress w: Ty — Rona framework (T, p) is called a self-stress if it resolves the 
zero load: 


ye wij (pj — pi) = 0 for alli € To. (7) 
JETo 


Theorem 5.3. Let (I, p) be a framework in R* with a planar 3-connected graph T 
and such that for every face a € T2 the points {p; | i € a} are not collinear. Then 
the following conditions are equivalent: 


1) the framework has a self-stress that is non-zero on all edges; 
2) the framework has a reciprocal diagram; 
3) the framework has a vertical polyhedral lift; 


4) the framework has a radial polyhedral lift. 
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Proof. 1) => 2) From a self-stress w construct a reciprocal diagram ([‘*,m) in 
the following recursive way. Take any face a and define mg, € R? arbitrarily. If for 
some face @ the point mg, is already defined, then for every 6 adjacent to a put 


Mg = Mg + wij J (pj — Pi); 


where J: R? — R? is the rotation by the angle 5, ij is the edge dual to a, and the 
pair (wB, ij) is consistently oriented. In order to show that this gives a well-defined 
map m:T, > R2, we need to check that the sum vi wi; J (p; — pi) vanishes along 
every closed path in the graph *. Viewed as a simplicial chain, every closed path 
is a sum of paths around vertices. The sum around a vertex vanishes due to (7). By 
construction, mg — mg 1 pj; — pj and mg # mg for o and f adjacent in I*, thus 
(I'*, m) is a reciprocal diagram for (T, p). 


2) => 1) Let (af, i/) be a consistently oriented dual pair. Since mg — mg L 
Pj — Pi. there is w;; € Rsuch that mg —ma = wij J(p;j — pi). The map w:T') > R 
thus constructed never vanishes and satisfies (7). 


3) => 2) Given a polyhedral lift of (I, p), let My C R? be the plane to which 
the face a is lifted. Since Mg is not vertical, it is the graph of a linear function 
fu: R* > R. Put my = grad fy. For every dual pair (a, ij) we have 


Di» Pj Ee Man Msg. 


This implies that the linear function fg — fg vanishes along the line through p; and 
p;, hence 


Ma — mp = grad( fa — fg) L pi — pj. 


2) = > 3) Givenareciprocal diagram (I*, m), construct a polyhedral lift recur- 
sively. Take any a and let fy): IR? > Rbe any linear function with grad fo, = Mag. 
If fy is defined for some a, then define fg for every 6 adjacent to a by requiring 


grad fg =mpg, fp — fo = 0on the line p; p;, 


where ij is the edge dual to wf. These conditions are consistent due to mg — mg L 
Pj — pi- In order to check that the recursion is well-defined, it suffices to show that if 
we start with some fq and apply the recursion around the vertex 7 € a, then the new 
Fu will be the same as the old one. This is indeed the case because by construction 
all fg with i € B take the same value at p;. A polyhedral lift of (I’, p) is obtained 
by putting p; = fy(p;) for any a 3 i. 
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3) <> 4) Consider R? as an affine chart of RP. There is a projective transfor- 
mation ®: RP? —> RP? that restricts to the identity on R? C R? and sends the point 
a to the point at infinity that corresponds to the pencil of lines perpendicular to R?. 
(This transformation exchanges the plane at infinity with the plane through a parallel 
to R?.) We have pr, = pr, o®. Therefore if p is a radial polyhedral lift of p, then 
® o pis a vertical lift of p. Conversely, if p is a vertical lift such that p; does not lie 
on the plane through a parallel to R?, then ®~! o # is a radial lift. Any orthogonal 
lift can be shifted in the direction orthogonal to R? so that its vertices do not lie on 
the plane through a parallel to R?. Therefore the existence of an orthogonal lift is 
equivalent to the existence of a radial lift. 


Example 5.4. The Maxwell—Cremona correspondence allows to prove the rigidity 
criterium for the framework from Example 2.12. The lines a, b, c are concurrent if 
and only if the framework has a vertical lift, see Figure 11. 


Figure 11. A vertical lift of the framework from Example 2.12 


Remark 5.5. The spaces of self-stresses, reciprocal diagrams, and polyhedral lifts 
have natural linear structures. The correspondences described in the proof of Theo- 
rem 5.3 are linear, see also [11]. 


Every graph I has a geometric realization |['|: assign to the vertices points in 
R? in general position, and to the edges the segments between those points. A map 
To — R? can be extended to a map |I'| > R? by affine interpolation. We call this 
the rectilinear extension. If the rectilinear extension is an embedding, then every face 
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of I’ (viewed as a cycle of edges) becomes a polygon. In this case there is one face 
that is the union of all the other faces; we call it the exterior face (the term comes 
from the identification of R? with a punctured sphere). The edges of the exterior face 
are called boundary edges, all of the other edges are called interior edges. 


Theorem 5.6. Let (I, p) be a framework in R? with a planar 3-connected graph T 
and such that the rectilinear extension of p to |C| provides an embedding of V into 
R? with convex faces. Then the following conditions are equivalent: 


1) the framework has a self-stress that is positive on all interior edges and negative 
on all boundary edges; 


2) the framework has a reciprocal diagram such that for every consistently ori- 
ented dual pair (aB,ij) the pair of vectors (pj — pi, mg — Mg) is positively 
oriented if ij is an interior edge and negatively oriented if ij is a boundary 
edge; 


3) the framework has a vertical polyhedral lift to a convex polytope; 


4) the framework has a radial polyhedral lift to a convex polytope. 


Proof. It suffices to show that the constructions in the proof of Theorem 5.3 respect 
the above properties. 


1) <> 2) Since a self-stress is related to a reciprocal diagram by the formula 
mg —Maq = wij J(p; — pi), the pair (pj — pi,mg — Ma) is positively oriented if 
and only if w;; > 0. 

2) <> 3) Since mg — mag = grad( fg — fa), the pairs (pj — pi, mg — ma) for 
all interior edges ij are positively oriented if and only if the piecewise linear function 
over the union of the interior faces defined by f(x) = fy(x) for x € a@ is convex. 
The graph of this function together with the lift of the exterior face (that covers the 
union of the interior faces) form a convex polytope. 


3) < > A) The projective image of a convex polytope (provided no point is sent 
to infinity) is a convex polytope. The orthogonal lift can be made disjoint from the 
plane that is sent to infinity by shifting in the vertical direction. 


Remark 5.7. By adding a linear function to an orthogonal polyhedral lift we can 
achieve that the exterior face stays in R?. A convex polytope of this kind is called a 
convex cap. An example is given on Figure 11. 
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Remark 5.8. The only self-intersections of the reciprocal diagram from Theorem 5.6 
involve the edges ma,mg, where a is the exterior face of I (and there is no way to 
get rid of all self-intersections unless I’ is the graph of the tetrahedron). The recip- 
rocal diagram can be represented without self-intersections by replacing every edge 
Ma Mg with a ray running from mg in the direction opposite to mg). Complexes of 
this sort are called spider webs in [53]. 

Non-crossing frameworks with non-crossing reciprocals (and thus with some non- 
convex faces) are studied in the article [40]. 


Remark 5.9. The Dirichlet tesselation of a finite point set and the corresponding 
Voronoi diagram are a special case of a framework and a reciprocal diagram of the 
type described in Theorem 5.6. The vertical lift is given by p; = (pj, || pil|7). The 
Voronoi diagram represents the reciprocal in the form of a spider web as described in 
the previous remark. A generalization of Dirichlet tesselations and Voronoi diagrams 
are weighted Delaunay tesselations and power diagrams. One of the definitions of a 
weighted Delaunay tesselation is a tesselation that possesses a vertical lift to a con- 
vex polytope. Thus one can add a fifth equivalent condition to Theorem 5.6: the 
framework is a weighted Delaunay tesselation. For details see [3]. 

In [53] the spider webs were related to planar sections of spatial Delaunay tesse- 
lations. 


Remark 5.10. Not every convex tesselation and even not every triangulation of a con- 
vex polygon has a convex polyhedral lift, see [13, Chapter 7.1] for the “mother of all 
counterexamples.” Those that do are called coherent or regular triangulations (more 
generally, tesselations). There is a generalization to higher dimensions, see [13]. 


5.3. Maxwell—Cremona correspondence in spherical geometry 


Definition 5.11. Let (I, p) be a framework in $? with a planar 3-connected graph I’. 
A weak reciprocal diagram for (I, p) is a framework (I'*,m) in $? such that 


1) for every dual pair (wf, ij) the geodesics p; pj and mame are perpendicular; 


a 


2) for every incident pair (a, 7) the distance between mg and p; is different from > 


A strong reciprocal diagram is defined in the same way except that condition 2) is 
replaced by 


2') for every incident pair (a, i) the distance between mg and p; is less than 3 
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The reciprocity conditions can be rewritten as 


(ma, pi)(mB, Pj) — (Ma, pj)(mB, pi) = 0 (8) 
(Ma, Pi) #0 (9) 
(Ma, Pi) >0 (9’) 


The left hand side in (8) equals (71g x mg, Pi X Pj). 


Definition 5.12. Let (I, p) be a framework in $? with a planar 3-connected graph T° 
and such that for every face a € [2 the points {p; | i € a} are not collinear (that is, 
do not lie on a great circle). A weak polyhedral lift of (T, p) is a map p:T) > R? 
such that 


1) pi = a; p; for every i € To, where a; 4 0; 
2) for every face a € I the points {p; | i € aw} are coplanar; 
3) the planes of the adjacent faces differ from each other. 


A strong polyhedral lift is defined similarly but with a; > 0 in condition 1). 


Theorem 5.13. Let (I, p) be a framework in $* with a planar 3-connected graph 
and such that for every face a € T2 the points {p; | i € a} are not collinear. Then 
the following conditions are equivalent: 


1) the framework has a self-stress that is non-zero on all edges; 
2) the framework has a weak reciprocal diagram; 


3) the framework has a weak polyhedral lift. 


Proof. 1) = > 3) By Lemma 3.7, a self-stress w gives rise to a map A:T; > R 
such that 


S> Az Dy || pi for alli € To. (10) 
JeTo 


Pick an a € [2 and define may € R3 arbitrarily. Define m: Tz > R3 recursively: 
if 7”, is already defined, then for every 6 adjacent to a put 


mp = Ma + dij (pi X pj), 


where (af,ij) is a consistently oriented dual pair. Equation (10) implies that the 
closing condition around every vertex 7 holds: 


> Ai (pi X pj) = 0. 
j 
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Thus we have a well-defined map m: 2 > R? with 
Mpg —Meq || Pi X Pj 


for any dual pair (#B, ij). In particular, 7g —m, 1 p;, which implies that for every 
i there is c; € R such that 

(Ma, Pi) = Ci 
for all w incident to 7. For a generic initial choice of 7g, we have c; € 0 for all 7. If 
we put pj = oe then we have 


(Me, Di) = 1 


for every incident pair (a, i). It follows that for every a € [2 the points {p; | i € a} 
are coplanar and span a plane orthogonal to the vector 7. Due to Ai; 4 0 for every 
edge ij the planes of adjacent faces are different, thus we have constructed a weak 
polyhedral lift of (1, p). 

3) => 2) Let My C R? be the plane containing the points {p; | i € a}. Since 
the points {p; | i € a} are not collinear, the plane My does not pass through the 
origin. Thus it has an equation of the form 


My = {x € RB? | (Mg,x) = 1} 
for some jg € R?. In particular, for any dual pair (wB, ij) we have 
(mg — Ma, Pi) = (MgB — Ma, pj) = 9. 


Hence the vector mg — mq, and with it the plane spanned by mq and mg, is per- 


geodesic mame, is perpendicular to the geodesic p; p;. Since (/Nq, Pi) = 1, we have 


(mq, pi) # 0. Thus (I'*, m) is a weak reciprocal diagram to (T, p). 
2) => 3) Let (I*,m) be a weak reciprocal diagram for (T, p). We construct 
lifts m and p such that 


pendicular to the plane spanned by p; and p;. If we put my = le ie then the 


(Ma, Pi) = 1 (11) 
for every incident pair (a, i). The construction is recursive. 
Pick & € 12 and lift mg, arbitrarily. Due to (9), for every i € wo there is a lift 
pi of p; such that (771q,, pi) = 1. If mq is already defined, and f is adjacent to a, 
then let ij be the edge dual to wf. First determine the lift p; from the condition (11), 
and then determine the lift mg from the same condition with B in place of a. Note 
that if we use p; instead of p;, then the result will be the same: due to the reciprocity 
conditions (8) and (9) we have 


(Ma, Pi) = (Ma, Pj) => (MB, Pi) = (MB, Pj)- 
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This recursive procedure leads to well-defined lifts m and p: going around a ver- 
tex i does not change the value of m, because both the initial and the final values 
satisfy (11). 


3) => 1) Let m:T2 — R? be the map constructed during the proof of the im- 
plication 3) => 2). Asit was shown, for every dual pair (af, ij ) the non-zero vector 
mg — Mq is perpendicular to p; and p;. Thus we have a map A: Ty — R such that 

Mp — Mz = Vij Pi x Pj. 


To determine the sign of A;;, we order the vertices so that the pair (a, ij) is consis- 
tently oriented. Summing around a vertex i of I we obtain 


px > Agny =0. 
JETo 


Hence >> j€D Aij Pi || pi and by Lemma 3.7 the map A gives rise to a non-zero self- 
stress on (IT, p). 


We don’t know what conditions on a framework and the stress guarantee the ex- 
istence of a strong reciprocal diagram. At least it is necessary that the vertices of 
every face are contained in an open hemisphere. The next theorem shows that strong 
reciprocal diagrams correspond to strong polyhedral lifts. 


Theorem 5.14. Let (I, p) be a framework in $? as in Theorem 5.13. Then the fol- 
lowing conditions are equivalent: 


1) the framework has a strong reciprocal diagram; 


2) the framework has a strong polyhedral lift. 


Proof. In the proof of 3) => 2) in Theorem 5.13, note that for a strong lift p the 
equation (7%q, pi) = 1 implies (mq, pi) > 0, so that the reciprocal diagram con- 
structed from a strong lift is strong itself. 

In the proof of 2) => 3) in Theorem 5.13, lift 71g, strongly (that is scale it by a 
positive factor). Condition (11) implies that all p; with? € ap are also lifted strongly. 
The recursion propagates the strong lift to all mg and p;. 


As in the Euclidean case (see the paragraph before Theorem 5.6), a spherical 
framework defines a geodesic extension, that is, a map |I"| — $? that sends every 
edge to an arc of a great circle. A geodesic extension is called a convex embedding 
of I’ if it is an embedding and every face is a convex spherical polygon. 
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Theorem 5.15. Let (I, p) be a framework in §? with a planar 3-connected graph 
T’ and such that its geodesic extension is a convex embedding. Then the following 
conditions are equivalent: 


1) the framework has a Self-stress that is positive on all edges; 


2) the framework has a strong reciprocal diagram that embeds * into $* with 
convex faces; 


3) the framework has a strong lift to a convex polyhedron. 


Proof. 1) => 3) Inthe proof of the corresponding implication in Theorem 5.13 we 
have A;; > 0 for all edges ij. This implies that as we go around a vertex i, the vertices 
Mq for all w adjacent to i form a convex polygon. The union of these polygons is 
the boundary of a convex polyhedron that contains 0 in its interior. Its polar dual is a 
strong lift of (T, p). 


3) = > 2) Aconvex polyhedron that is a strong lift of (I’, p) contains 0 in the in- 
terior. Thus its polar dual is also a convex polyhedron. The projection of the 1-skele- 
ton of the dual is a strong reciprocal diagram with convex faces. 


2) ==> 1) Inastrong reciprocal diagram with convex faces the geodesics mgmz 
and pj; p; that correspond to a dual pair are consistently oriented. When we lift such 
a diagram as in the proof of 2) => 3) = > 1) in Theorem 5.13, we obtain real 
numbers A;; > 0 that provide a positive self-stress on (I, p). 


The latter version of the spherical Maxwell—Cremona correspondence was de- 
scribed in [34]. 


Remark 5.16. As in the Euclidean case, not every convex tesselation of the sphere 
has a convex polyhedral lift. The corresponding theory predates the theory of reg- 
ular triangulations in the Euclidean space and was developed by Shephard [48] and 
McMullen [35]. See also [17]. 


5.4 Maxwell—Cremona correspondence in hyperbolic geometry 


Let (I, p) be a framework in H? with a planar 3-connected graph I. A recipro- 
cal diagram is a framework (I'*,m) in H? such that for every dual pair (a, ij) 
the geodesics mgmg and pj; p; are perpendicular. In terms of the Minkowski scalar 
product this means 


(Ma, Pi)(Mp, Pj) — (Ma, Pj)(mpg, Pi) = 9. 
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Remark 5.17. The above criterion of orthogonality of mamg and pj; p; as well as 
its spherical analog (8) can be reformulated as follows. Diagonals in a spherical or 
hyperbolic quadrilateral with side lengths a, b,c, d in this cyclic order are orthogonal 
if and only if 

cosx a COSx C = COSx cosy d. 


The diagonals of a Euclidean quadrilateral are orthogonal if and only if a” + c? = 
b? + d?. 


Definition 5.18. Let (I, p) be a framework in H? with a planar 3-connected graph 
T and such that for every face a € [> the points {p; | i € a} are not collinear. 
A polyhedral lift of (T, p) is a map p: Tp — R? such that 


1) pi = a; p; for every i € Yo, where a; > 0; 


2) for every face a € I> the points {p; | i € a} are contained in a space-like 
plane; 


3) the planes of the adjacent faces differ from each other. 


Theorem 5.19. Let (I, p) be a framework in H? with a planar 3-connected graph 
T and such that for every face a € 2 the points { p; | i € a} are not collinear. Then 
the following conditions are equivalent: 


1) the framework has a self-stress that is non-zero on all edges; 
2) the framework has a reciprocal diagram; 


3) the framework has a polyhedral lift. 


Proof. 1) => 3) Proceed as in the proof of Theorem 5.13 to construct a map 
im: 2 — R? such that 


Mpg — Ma || Pi X Pj 


(with the Minkowski cross-product) for every dual pair (@B,ij). By changing the 
position of 77, and scaling down the self-stress w we can achieve that all 77, belong 
to the upper half of the light cone. Then the planes (7, x) = —1 bound a polyhedron 
with space-like faces that is a polyhedral lift of (1, p). 


3) = > 2) Similarly to the proof of Theorem 5.13, let (77q, x) = —1 be an equa- 
tion of the plane containing the points {p; | i € a}. Since these planes are space-like, 
Ma are time-like, and since p; belongs to the upper half of the light cone, 771, also 
does. Hence (I'*, m) is a reciprocal diagram in H?. 
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2) => 3) The proof is the same as in Theorem 5.13, we lift (T', p) and (*, m) 
recursively and at the same time. 

3) => 1) Also the same as in Theorem 5.13, but with the Minkowski cross- 
product instead of the Euclidean. 


For a framework (I, p) in H? the geodesic extension || — H7? is an analog 
of the rectilinear extension in the Euclidean case: an edge ij of I’ is mapped to the 
geodesic segment p; p;. If the geodesic extension is an embedding, then we define 
the interior and exterior faces and the interior and boundary edges as in the Euclidean 
case, see the paragraph before Theorem 5.6. 

For a consistently oriented dual pair (af, ij) we say that the lines p; p; and mymg 
are consistently oriented if the directed line magmg is obtained from the directed line 
Pi p; through rotation by 5 around their intersection point. 


Theorem 5.20. Let (I, p) be a framework in H? with a planar 3-connected graph 
T and such that the geodesic extension of p to || provides an embedding of T into 
H? with convex faces. Then the following conditions are equivalent: 


1) the framework has a self-stress that is positive on all interior edges and negative 
on all boundary edges; 


2) the framework has a reciprocal diagram such that for every consistently ori- 
ented dual pair (a, ij) the lines p; p; and mymg are consistently oriented if 
ij is an interior edge and non-consistently oriented if ij is a boundary edge; 


3) the framework has a polyhedral lift to a convex polytope in the Minkowski 
space. 


Proof. The proof consists in checking that the constructions in the proof of Theo- 
rem 5.19 respect the above properties. 


Remark 5.21. A variant of the Maxwell—Cremona theorem for hyperbolic frame- 
works uses an orthogonal polyhedral lift to the co-Minkowski space instead of a 
radial polyhedral lift to the Minkowski space described above. For details on the 
co-Minkowski space see [16]. 


Remark 5.22. If we allow the faces of the polyhedral lift to be time-like or light-like, 
then the vertices of the corresponding reciprocal diagram become de Sitter or ideal. 
Since the reciprocity is a symmetric notion, it is natural to allow de Sitter and ideal 
positions for the vertices of the framework as well. This puts us into the more general 
context of hyperbolic-de Sitter frameworks or point-line-horocycle frameworks, see 
Section 2.3. 
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PART Il 
Projective geometries 


Chapter 13 


Contributions to non-Euclidean geometry I* 


Eduard Study 


The author’s intention is to edit a series of articles under a common title concern- 
ing algebraic and differential geometry in non-Euclidean space, a subject that he has 
been investigating for many years although not yet arriving at any other than sketchy 
publications on the topic. 

In the first of the present three contributions,! a seemingly new property of the 
straight line will be discussed. The second article is independent from the first. It is 
entitled “the notions of left and right,’ but it also treats other fundamental notions 
of non-Euclidean geometry. It is meant to be an introduction to further articles, and 
therefore it is kept elementary. It may perhaps also serve as an introduction to the 
methods that the author used in his book Geometrie der Dynamen (Leipzig, 1903). 
The third note contains a simple application to helical surfaces in non-Euclidean 
space of positive curvature that are considered as geodesics in a four-dimensional 
manifold. 


I. Straight line and point as extrema 


It seems not to have been noticed so far that the theorem of the straight line as the 
shortest path between two points has two analoga in hyperbolic geometry, where the 
point remarkably does not appear as the shortest but as the longest path between two 
lines. 

If we extend the point continuum of Lobatchevsky’s geometry to projective ge- 
ometry by adding the infinitely far, or so-called ideal points in the well-known way, 
then one can also state this claim that obviously needs to be made more precise, in 
correlative form: the straight line or rather a piece of the straight line appears as the 
longest path between two points. 


* “Beitrage zur Nicht-Euklidischen Geometrie. I.”, Amer. J. Math. 29 (1907), no. 2, 101-116. Trans- 
lated by Annette A’Campo-Neuen. 

' [Editor’s note] In the present volume, only the first contribution is translated, whose subtitle is 
Gerade und Punkt als Extreme, translated, as below, by Straight line and point as extrema. 

2 We plan to show elsewhere that a part of the concept of the notions of left and right does not need 
the notion of measurement but that it can be considered in the real projective geometry of space. 
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We first consider the plane geometry of constant curvature —1, associated to the 
absolute polar system 


(xy) = x11 — X22 — X33 = 0. (1) 


Then we can uniquely define the distance (that is here always considered as positive 
> 0) between two points in the interior {(xx) > 0, (vy) > 0} of the absolute cone? 


by the formula 
2 | (xy)? 
(x, y) = arcecosh Gaon: (2) 


where we choose the positive value for the square root and the function arccosh. The 
same formula provides a real “distance” for the points x, y, even in the case when 
they lie outside of the absolute cone such that the straight line joining them meets the 
absolute cone. Moreover, also in this case for three points y, x, z on a straight line, 
where the one in the middle x, separates the other two, the following equation holds: 


(y,2) = (y, x) + (x, 2). (3) 


Now consider a triangle x, y, z of distinct points in the exterior such that each of 
the lines joining two of the points meets the absolute cone. Hence 


(yz)? — (vy)(zz) > 0, (xx) <0, 
(zx) — (zz)(xx)=0, (vy) <0, (4) 
(xy)? —(xx)(yy)>0, (zz) <0. 


Then one can distinguish two families of such triangles. If 


(yz)(zZx)(xy) < 0, (Sa) 


then the longest side of the triangle x, y, z is always greater than the sum of the other 
two, or at least equal to their sum. And equality holds if and only if the three points 
are collinear. 

If however 


(yz)(zx)(xy) > 0, (Sb) 


then the longest side may be greater or smaller than the sum of the other two, or it 
may be equal to their sum. But in this case the three points are never collinear. 
3 [Translator’s note] The author uses the term “absoluter Kegelschnitt," i.e. absolute conic section, 


for the zero set of the bilinear form. Indeed the lines through zero on the cone form a conic section in 
the projective plane. 
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Let us now assume that 


222 sx), (ys 2) = (2), (6) 


and ask under what assumptions we can conclude that 
(y,2) = (y,x) + @, 2). (7a) 
Obviously, we can rewrite inequality (7a) in this form: 
cosh((y, Zz) — (y, x)) = cosh(x, z). 


After solving by using the addition theorem for the hyperbolic cosine on the left-hand 
side, the inequality takes the following form, assuming positive values wherever a 
square root is taken: 


V (yz)? V (xy)? + (vy) v (xz)? = V (yz)? — (vy)(zz) V (xy)? — (xx) (yy). 


Here, by the assumptions (4) and (6), both sides are positive. Hence the inequality is 
equivalent to the one that we obtain by taking squares: 


(xx)(yy)(2z) — 2 (yz)? (2x)? (xy)? — (xx)(yz)? — (vy) (2x)? — (zz) (xy)? = 0. 
On the other hand, 


(xx)(yy)(2z) + 2(yz)(2x)(xy) — (xx) (yz)? — (vy) (2x)? — (2z)xy)? = 0, 


since the left-hand side is equal to the square of the determinant (xyz). Therefore, if 
the inequality (5a) holds, then the last two inequalities coincide and (7a) holds. They 
are both equalities if (xyz) = 0. However, we cannot conclude that conversely (7a) 
implies (5a). 
Now let us assume that (5b) holds. Then we claim that any of these relations are 
possible 
(y.2) 5 (yx) + @2), (7b) 


or as above (assuming (6) analogously) the following may occur 


(xyz)? — 4(yz)(zx)(xy) 2 0. 


In fact, one can verify this easily by examples or by the argument given below. In this 
case (xyz) never vanishes because then in (7b) we would have equality. Therefore 
the product (yz)(zx)(xy) would also vanish. But this is impossible by (4). 

In order to obtain a clear idea of the corresponding triangular figures we want to 
assume that the points y, z are given. It is easy to see that then the set of all points x 
satisfying the inequalities (5a) or (5b) is represented by the shaded regions in Figure 1. 
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Vertical shading indicates that 


(yz)(Zx)(xy) > 0, 


and horizontal shading indicates that 
(yz)(zx)(xy) < 0. 
Moreover, Figure 2 indicates with vertical and horizontal shading where 
(y,z)—(y,x)—(%,z) 20, or (y,z)—(y,x)—(,2) 0. 
The figure also shows with fat lines the locus of all points x where 
(yz) —(y, x) — (@,z) = 0. 


This locus consists of three completely separate curves. One of these curves is 
linear whereas the other two are parts of the curve of order 2 


(yzx)? — 4(yz)(yx)(zx) = 0. (8) 
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Firstly, this curve touches the polar lines of the points y, z, that is the lines (yx) = 0 
and (zx) = 0, at their intersection points with (yzx) = 0. Now if we write down its 
equation in line coordinates using the notation 


(UX) = XyUy + U2QX2 +U3X3, (UV) = UyVy — UQV2 — 4303, 


then after splitting off the non-vanishing factor 4(yz){(yz)* — (yy)(zz)} from the 
quadratic covariant of the expression (8), one finds the equation 


(yz)(uu) — (uy)(uz) = 0. (9) 


Fig. 2. 


Hence one can see that the curve (8) touches the minimal lines through y, z, i.e. the 
tangents to the absolute cone going through y, z, of course in real points. Finally, 
through the two touching points of the curve (8) with the minimal lines through z 
there is the locus of (y, x) = (y, Z) which is not drawn in Figure 2, but could easily 
be added. The regions within the curves (y, x) = (y,Z) and (z, x) = (y, Z) belong 
to the inequalities (y,z) < (y,x) and (y,z) < (z,x), and are therefore horizon- 
tally shaded. The curve (8) however, separates the two kinds of shading wherever it 
lies outside of these regions. Now it is easy to see that the shadings are carried out 
correctly by letting x run through the boundary curves of the different regions. 

For greater clarity, we add here a third figure—in collinear modification—show- 
ing for a given x, where the points y and z have to be if we require simultaneously 


(yz)(zx)\(xy) <0 and (y,z) > (y,x) + @,2Z). (10) 
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Fie. 3. 


In this case, which will now be investigated further, we will say: “The point x 
separates the points y and z,” or “The point x lies between y and z.” If there is no 
minimal line among the lines joining the points x, y, z then the polar lines of the 
points x, y, z are in the relative position indicated in Figure 4a, whereas the triangles 
of the second family correspond to Figure 4b. 


Fia, 4a. Fie. 4b. 


In order to avoid lengthy arguments, for the following theorems we refer to intuition 
(see Figure 4a), or an obvious continuity consideration: 


If the point x' lies between y and 2, and the point x" lies between (y and x' or) 
x’ and z, then x" also lies between y and z. 

If two different points lie between y and z in such a way that the straight line join- 
ing them meets the absolute cone then all four points have a certain order y, x', x", z 
orz,x",x', y. More precisely, after suitable choice of notation, x' lies between y and 
x", and x" lies between x' and z, but not x" between y and x', and not x’ between 
x” and z. 

If, moreover, the point x lies between two such points x' and x", then x also lies 
between y and z.* 


4 All this could be proved rigorously with purely algebraic means but as far as we can see the proof 
would not really be short. Unless it concerns fundamental theorems of deep theories, we may well permit 
ourselves to shorten the presentation if it is possible to fill the gaps easily if needed. 
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The two drawings in Figure 5, may sufficiently illustrate how those theorems are 
derived. 


Now it is easy to conclude the following remarkable theorem: 


Suppose that there is a given set of points outside of the absolute cone, forming 
a continuum, and such that each straight line joining any two different points of the 
set meets the absolute cone. Then every point of the set with at most two exceptions 
yo, Zo, has the following property: If this point x is removed from the set then the 
remaining points of the set form two again continuous subsets (y) and (z) such that 
x separates all the points y of the first set from all the points z of the second set. 

All the points of the set lying between two different points y, z form together 
with y, z a closed, continuous curve whose points x can be assigned bijectively and 
continuously the values of a parameter t between 0 and 1. 

This curve has a well-determined length S' satisfying the inequalities 


0<S <(y,z). 


Moreover, if there are two exceptional points yo, Zo as mentioned above, then 
the entire set consists of these two points and a chain of curves joining them with a 
certain arc length and whose points different from yo, Zo all lie between yo and Zo. 
The points y, z referred to in the theorem always preserve the property of such limit 
points yo, Zo if one removes all those points of the set different from y, z that do not 
lie between y and z. 
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Fig. 6. 


Such a chain of curves as described in the theorem will be called a path between y 
and z. Now we have the result announced in the beginning: 


If y and z are two different points outside of the absolute cone, whose joining 
straight line meets the absolute cone, then the straight line is the longest among all 
paths between y and z. 


Note these further useful remarks: If the straight line joining the points y, z is a 
minimal line, i.e. if it touches the absolute cone, then this straight line is the unique 
path between y and z. In every other case, there are infinitely many paths between y 
and z, and arbitrarily close to any such path there are infinitely many other paths of 
length zero. 

Figure 6 illustrates how to construct a path that is arbitrarily close to the longest 
path, but itself is of length zero. 


For the points of a straight line not intersecting the absolute cone, i.e. a line lying 
entirely in the ideal domain, we can define an elliptic measurement.*> The distance 
of two points x, y on such a line will then only be determined up to sign and up 
to a multiple of a certain period, that we can assume to be 2. Among the different 
values of this function, we choose the one with smallest absolute value, consider it 
as a positive value and denote it by (x, y). If moreover, we choose for all occurring 
roots their positive values, we obtain the equation 


(xy)? 
(xx)(yy)’ 


cos(x, y) = 


(1) 


and we always have 
0 <cos(x,y) <1. 


5 [Translator’s note] The author uses the term “Massbestimmung." It is not a metric however, since 
for the points in the exterior the triangle inequality does not always hold. 
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We can think of the same formula (11) as being true in the limiting case where 
the straight line containing x, y touches the absolute cone, but neither x nor y is the 
point of contact. We then set (x, y) = 0. 

Now we want to show that the hereby defined distance (x, y) as well as its sup- 
plement 


[x,y] =a-(,y) 


also has the property of being a maximum among all the lengths of certain “paths” 
between the two points, that still have to be defined. 

Again we consider a triangle x, y,z of different ideal points but now such that 
the straight lines joining any two points shall not meet or at most touch the absolute 
cone: 


(yz)? — (vy)(zz) <0, (xx) <0, 
(zx)? — (zz)(xx) <0, (yy) <0, (12) 
(xy)? —(xx)(yy) <0, (zz) <0. 


Without loss of generality, we may assume that 


20.272 0,2) 2 (2) 20. (13) 
The inequalities that we will have to assume consecutively 
(yz) 2 (yx) + (2) (14) 
may then be written in the following form: 
cos(x, Z) — cos(y, Z) cos(y, x) = sin(y, Z)- sin(y, x). 


Since here by assumption both sides are positive values, taking squares leads to equiv- 
alent inequalities 


(xx)(yy)(@2) — 2V 2)? Ex)? ey)? — (ax)(y2)? — (vy) (ex)? = (22)ay)? 2 0. 
As before one can see that here only > or equality is possible if 
(yz)(Ex)(xy) <0, (15) 


whereas in the case 
(yz)(zx)(xy) > 0 (16) 


one cannot draw a conclusion. 
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Fie, 7. 


It will be sufficient if moreover we first exclude the easily treated limiting cases 


T 
V2)=5> [y.z] and (y,z)=0,[y,zZ]=7 
and hence assume that 
0 <(y,z) < > <[y,z] <2. 


Each of the two segments into which the straight line yz is decomposed by the two 
points y,z is now a diagonal of a simple quadrangle whose sides are pieces of minimal 
lines (compare Figure 7). 

In the interior or on the boundary of these two quadrangles are all the points x 
satisfying the inequalities (12). Apart from that both quadrangles have different prop- 
erties. By integrating the arc element along a straight line between y and z through 
the interior of one quadrangle one obtains the smaller value (y, z), and by integrating 
along a straight line through the other quadrangle one obtains the larger value [y, z]. 
The first “smaller” quadrangle as we call it for simplicity, is completely contained 
in the domain (yz)(zx)(xy) < 0. The other “larger” quadrangle is decomposed 
by the absolute polar lines of the points y, z into three parts, two outer parts where 
also (yz)(zx)(xy) < 0, and one inner part, where (yz)(zx)(xy) > 0. Moreover, 
analogously to above one finds that the locus of all points x with 


(y,2) = (y, x) + (%, 2) (17) 
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consists of three separate curves. The first one of these is linear and lies entirely in 
the smaller quadrangle. The other two lie on the irreducible curve of degree 2 


(yzx)? — 4(yz)(zx)(xy) = 0 (18) 


and are contained entirely in the larger quadrangle. Thus one finally obtains insight 
into the distribution of the regions of x where 


(y, 2) — (y.%) — (&%, 2) = 0 


holds and the other ones with 


(y,z)— (vy, x) — (x, z) <0. 


In Figure 7, the first ones are indicated by vertical shading and the second ones or 
more precisely the one second domain is indicated by horizontal shading. 

Now as above for the hyperbolic measurement, we obtain a notion of path be- 
tween y and z lying in the smaller quadrangle, and we find the theorem that among 
all such paths the straight line is the longest. We further claim that in the larger quad- 
rangle the same relations hold. That is, we claim that in the triangle x, y, z the length 
of the side yz is not less than the sum of the lengths of the other two sides if x is 
contained in the larger quadrangle and if all three lengths are defined via integration 
of the arc element along a straight line such that the path of integration does not leave 
the larger quadrangle. 

One can see without difficulty that for the proof of this claim one has to distinguish 
three cases. 


1) The point x lies in the middle part of the larger quadrangle (or on its boundary). 
Then (yz)(yx)(zx) > 0. The three lengths are [y, z], (vy, x) and (x, z). The 
claim is 


Ly, Z] eas (y, x) am (%; 2). 
2) and 3) The point x lies in one of the outer parts, e.g. in the part cut off by the 
polar line of the point y, or such that (y, x) > (x, z). Then (yz)(yx)(zx) < 0, 


the three lengths are [y, z], [y, x], (x, z), and the claim is 


[y.z]>[y,x]+(@,z) or (y,x) > (y,2z) + ©, 4). 


In all these cases the proof can be carried out in the same manner as above. 
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Now we will state the result in a shorter form than previously and we will essen- 
tially only point out what is different from the previous theorem. 

We call the used kind of measurement pseudo-elliptic since it coincides with a 
usual elliptic one in constant positive curvature (+1), but differs substantially from 
it by the existence of paths of length zero. Using this terminology we can then say: 


If y, z are two different points whose straight line joining them does not inter- 
sect the absolute cone, then there are two families of direct paths between y and z. 
Namely, there are two simple quadrangles bounded by minimal lines having y and z 
as opposite vertices. In the interior or on the boundary of each of these quadrangles 
there are the paths of the first or the second family respectively. Among all paths of 
the same family, using pseudo-elliptic measurement, the straight line appears as the 
longest. 


Each direct path is a piece of a curve of a certain arc length whose secants either 
do not meet or at most touch the absolute cone. By any of its points x different from 
y and z, it is decomposed into two paths with the same property. If in the limiting 
case that we first excluded, y and z remain ideal points such that the straight line yz 
touches the cone then there is exactly one “proper” and one “improper” direct path 
between y and z. Both are linear. The first one lies entirely outside of the absolute 
cone; it has length zero. The length of the second path is not properly defined, but 
maybe set = zr. 

A piece of a curve can have the property that around each of its points one can 
draw a region where the piece of the curve is a direct path between any two of its 
points y’, z’, even though it may not have this property as a whole. For such pieces of 
curves, joining two given points y, Z, the last theorem does not hold. On the contrary, 
one can obviously join two arbitrary ideal points—even those whose joining straight 
line intersects the absolute cone in separate points—by such curves whose length 
exceeds an arbitrary given number. The length of the direct path between the points 
y, Z, if such a path exists, is hence in a much more restricted sense a maximum than 
the path length considered with respect to hyperbolic measurement. 


Since the introduction of the so-called ideal points in Lobatchevsky’s geometry 
is not really necessary, one might prefer a different formulation of the presented the- 
orems avoiding that notion. Of course the translation from one language into the 
other can be carried out easily. For example the first result that we discussed can be 
rephrased as follows: 
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If among three straight lines y, x, z, that are pairwise either parallel or ultra- 
parallel, the middle one separates the other two, then the length of the common nor- 
mal between y and z is greater or equal to the sum of the lengths of the normals 
between y, x and between x, 2. 


Equality occurs if all three lines have a common perpendicular, or in the limiting 
case, if they are all parallel. If y, z are ultraparallel, the “length of the common normal 
between y and z” means the length of the shortest joining line between a point of y 
and a point of z, and in the limiting case of being parallel, length zero. Everything 
else follows immediately: The ideal point of intersection of two ultraparallel lines y, 
z, or its polar line, the common perpendicular of y, z, considered as the locus of the 
lines perpendicular to it, now appears as the longest path between these two lines. 

Similarly, one can translate our second theorem with its corollaries. The funda- 
mental fact in this case turns out not to be different from the well-known theorem on 
angles of a triangle, that can be expressed as follows: 


In each ordinary triangle one of the outer angles is greater or equal to the sum of 
the two inner angles that occur at the side opposite to the outer angle. And equality 
holds if and only if the three sides go through one common point. 


It is less intuitive to work with lines and their angles rather than with points and 
their distances. In the first case one does not have as in the second case a guiding prin- 
ciple in the analogous situation in Euclidean geometry. And that is the reason why 
up to now, one has not given this well-known result the turn on to which we focus 
here (if indeed the matter might be new). But if we present the straight line here as a 
however relative maximum, then this is certainly a fact that should not be missing in a 
complete system of non-Euclidean geometry. More generally, it may well be permit- 
ted to mention that as it seems to us, the development of non-Euclidean geometry is 
still very incomplete even in very elementary aspects. For example Lobatchevsky’s 
trigonometry is certainly a very important part but always only a part: triangles with 
all or some ultraparallel sides, in particular the hexagons with only right angles cor- 
responding to the first kind, have also to be considered in a system that claims some 
kind of completeness. The theory available at present still lacks finer perfection, the 
investigation of the corresponding groups, the connection to invariant theory etc., 
even though the tools for such investigations have long since been provided for.® 


© The methods and formulae available in the literature are completely sufficient for those purposes. 
See S. W. Jacobsthal, article “Trigonometrie” in Encyclopddie der elementaren Geometrie (Leipzig, 1905), 
and E. Study, “Spharische Trigonometrie” etc. (Abh. d. Sdichs., G. D. W. Leipzig, 1893), and E. Study, 
Geometrie der Dynamen, Leipzig 1903, pp. 209-212. 
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Now asking for an extension of the deduced results to arbitrary dimension, we 
find a remarkable difference among the sketched little theories. 


Our first theorem on the straight line as longest path can be generalized to an 
arbitrary dimension, the second theorem only holds for the plane. 


Namely, let the absolute polar system be given by 


(xy) = X11 — X2Y2 —++* — Ann = 0. 


Now first consider two points y, z outside of the absolute cone (xx) = 0 such that 
the straight line joining them meets the cone. In every plane through the line yz, 
we obtain the triangle figures that we have already investigated. So for hyperbolic 
measurement, instead of the quadrangle that we discussed above, we obtain a piece 
of the space of dimension (n — 1), bounded by pieces of two conoids emanating from 
y and z 


(yx) — (vy)(xx) =0, (zx)? — (zz)(ex) = 0. 


Now “paths” in this piece of space have to be defined basically as before, but they do 
not have to lie in planes. The straight path is the longest among all such paths. 

However, if the line yz does not meet the absolute cone then there are three differ- 
ent kinds of planar paths between y, z. Firstly, a plane through y, z might permeate 
the absolute cone in a real curve. For pseudo-elliptic measurement, (y, z) or [y, z] ap- 
pears as the maximum among all suitable piecewise linear paths. Secondly, the plane 
might touch the absolute cone. It is easy to see that in this case there are infinitely 
many equally long paths between both points. Finally, the plane may not intersect 
the absolute cone at all. In this case (y, Zz) is a minimum. Hence the length of the 
straight linear path in this case is not extremal. 

But this obviously does not exclude the possibility that on a manifold of lower di- 
mension with such a measurement, the problem of shortest (or longest) lines may 
get a clear content. For example in the eight-dimensional space, there is a four- 
dimensional manifold M4 with the following strange property for suitably chosen 
pseudo-elliptic measurement: 


The length of the shortest path measured on Mg itself between two points y, z in 
Mz equals the length of the straight linear path between these two points.” 


We only want to sketch other generalizations of the matters discussed. Similar 
extremal properties as the ones considered here, can be found for geodesics belonging 
to an arc element whose square is defined by a quadratic differential form in the case 


7 E. Study, “Ktirzeste Wege im komplexen Gebiet,” Math. Ann. 60 (1905), p. 361. 
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that this form for line elements yields a hyperbolic measurement. Moreover, in some 
parts of our previous considerations, one can replace the cone by an arbitrary oval. 
Somewhat related to these ideas is for example the following easily proved but maybe 
still not completely uninteresting theorem: 


Consider a set of points in the finite part of the Euclidean plane forming a closed 
continuum and having the property that all secants between different points of the set 
have an angle with a given straight line satisfying an inequality of the form 


| tan(y)| < M. 


Then this point set is a rectifiable curve joining certain points y, z. If D is the distance 
of the projection of the points y, z to the given line, then the arc length of this curve 


is at most equal to 
|DV1+ M?|. 
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‘*‘Contributions to non-Euclidean geometry I’ 
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The paper “Contributions to non-Euclidean geometry I’ [10] by Eduard Study, pub- 
lished in 1907, contains one of the first attempts to study the geometry of the exterior 
of the hyperbolic plane. In particular, the author defines a distance between points 
in the exterior of the disc model of this space sitting in the projective plane, and for 
triples of points in this exterior space, he discusses at length the various cases where 
the triangle inequality is satisfied and those where a reverse triangle inequality is sat- 
isfied. The setting is that of projective geometry. In the present note, we provide an 
outline of Study’s paper, together with some historical and mathematical comments. 


1 An outline of Study’s paper 
1.1 A model of the hyperbolic plane 


In the three-dimensional vector space IR? equipped with a basis in which points are 
represented by coordinates as x = (x1, X2, 3), the author starts with the quadratic 
form of index two defined by 


(x, x) = x? — x2 — x2. (1) 
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To this quadratic form is associated an absolute cone (the isotropic cone of the 
quadratic form) whose equation is 


(x,x) = 0. 
The interior of the absolute cone is the set of points x satisfying 
(x,x) > 0. 


A distance function (x, y) is defined on the interior of the absolute cone by the for- 


mula 
_ | (xy)? 
(x, y) = arccosh Enon. (2) 


Considering (x1, x2, x3) as the homogeneous coordinates of a point x in the 2-di- 
mensional real projective space and using the homogeneity of formula (2), the author 
gets a distance function on the quotient of the interior of the absolute cone by the 
positive reals, and this distance turns out to be a model of the hyperbolic plane. The 
goal of the paper is to extend this distance function to the points that are outside of 
this hyperbolic plane. 


1.2 The exterior of the hyperbolic plane 


The same formula (2) is used to define a distance function on the exterior of the abso- 
lute cone in R?. In the quotient projective plane, this will provide a distance function 
on the exterior of the hyperbolic plane, making this space a Lorentzian surface which 
is not Riemannian. The distance function is first defined on some pairs of points 
(x, y), namely, those for which the (Euclidean) straight line joining them meets the 
absolute cone. 

This distance function, as in the case of the above model of the hyperbolic plane, 
is “projective” in the sense that it satisfies the property that if three points x, y, z lie 
in that order on a Euclidean segment, we have 


(x, 2) = (x,y) + (, 2). 


In other words, Euclidean lines are geodesics for this distance function. For points 
x,y, Z Satisfying this equation, y is said to lie between x and z. 

Study considers triangles in this Lorentzian surface, that is, triples of points such 
that for each pair among them, the Euclidean line joining them meets the absolute 
cone. Certain triples of points satisfy the triangle inequality, whereas others satisfy 
the reverse triangle inequality. More precisely, Study proves the following: 
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Proposition 1.1. For a triangle x, y,z in the exterior hyperbolic geometry, 


(1) if (xy)(yz)(zx) < 0 then the longest side of the triangle is at least equal to 
the sum of the other two; 


(2) if (xy)(yz)(zx) > 0 then the longest side of the triangle may be greater, equal 
or smaller than the sum of the other two. Furthermore, in this case, the three 
points can never be collinear. 


We note that distance functions satisfying the reverse triangle inequality are stud- 
ied in the context of the geometrical setting of relativity theory, and the reverse trian- 
gle inequality is called in this setting the time inequality, cf. Busemann’s paper [2]. 

Study provides detailed descriptions of all the situations given in Proposition 1.1 
(Figures 1, 2, and 3 of his paper). 


1.3 The space of lines in the hyperbolic plane 


There is a polarity theory between the hyperbolic plane and its exterior where the 
points in the exterior represent lines in the hyperbolic plane, and vice versa, the points 
in hyperbolic plane represent lines in the exterior of that plane. Using this polarity, 
Study interprets the results stated above in terms of distances between lines in the 
hyperbolic space, and he provides pictorial representations (Figures 4 and 5 of his 


paper). 
1.4 Geodesics 


Study proves the following theorem which he describes as being “remarkable,” 
regarding geodesics for the distance function defined outside the absolute cone. The 
theorem has two parts: 


Theorem 1.2. The following holds. 


(1) Consider a set of points outside the absolute cone forming a continuum! and 
such that each straight line joining any two points of the set meets the absolute 
cone. Then every point x of the set, with at most two exceptions Xo and yo, 
has the following property: the complement of the point x in the set is a union 
of two continua such that x separates all points y from the first set and points 
z from the second set. 

' The terminology is not uniform. In the classical terminology, a continuum can be a closed con- 


nected subset of a topological space, or a compact connected metric space. In the present situation the 
two terminologies coincide. (There are other notions of continua.) 
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(2) Within the set of points described in (1), the set of points that lie between 
any two distinct points x and y form, together with x and y a continuous 
curve homeomorphic to the segment [0, 1] whose length is at most equal to the 
distance between x and y. 


A curve such as the one described in (2) of the proposition is called a path between 
x and y. The fact that its length is less than or equal to the distance between x and 
y follows from the observation that the entire path must lie within the region where 
the reverse triangle inequality holds. 

Study provides a description of the exceptional set of points xo, yo mentioned in 
the first part of Theorem 1.2. 


A second theorem is stated: 


Theorem 1.3. [f x and y are two distinct points outside the absolute cone such that 
the straight line joining them intersects the absolute cone, then this straight line is 
the longest among all the paths joining x and y. 


This gives an explanation of the following sentence at the beginning of the paper: 
“Tt seems not to have been noticed so far that the theorem of the straight line as the 
shortest path between two points has two analoga in hyperbolic geometry, where the 
point remarkably does not appear as the shortest but as the longest path between two 
lines.” 

A remark is made for the case where the straight line joining x and y is tangent 
to the absolute cone. In this case, this straight line is the unique path joining x and y. 
In all the other cases, there are infinitely many paths between these two points and 
arbitrarily close to each such path, there are infinitely many others whose lengths are 
zero. 


1.5 A measure on a line not intersecting the absolute cone 


Study then introduces a distance (x, y) between pairs of points (x, y) such that the 
line that joins them does not intersect the absolute cone. The distance is defined up 
to a sign and up to a period. Such a distance function is termed a “pseudo-elliptic 
measure.” Taking the sign for the distance between two points to be positive, the 
measure is given by the formula: 


af hae 
cos(x, y) = Gao (3) 
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In the limiting case where the given line touches the absolute cone, with neither 
x nor y being the point of contact, one sets, by continuity, (x, y) = 0. 

This distance function between two points is equal to the length of the line seg- 
ment joining them and its value is a maximum among all lengths of a certain col- 
lection of paths which Study defines between two points and which he studied in a 
certain detail. 

The term “pseudo-elliptic” measure used for this distance function is chosen be- 
cause of the analogy with the distance function on the sphere. In fact, taking as 
(xy) the Euclidean inner product in R? instead of the one associated to the quadratic 
form (1), formula (3) is a consequence of the usual cosine formula in R? applied to 
the triangle Oxy where O is the origin of the coordinates. It also gives the formula 
for the cosine of the distance between the points x and y on the sphere. (Recall that 
in the usual model of the sphere, centered at the origin of IR, the distance between 
two points is defined as the angle between the rays starting at the origin and passing 
through these two points.) 

At the end of the paper, Study discusses higher dimensional generalizations. He 
declares that Theorem 1.2 admits a generalization whereas Theorem 1.3 is essentially 
planar. 

He also declares that non-Euclidean geometry has to be further developed, for 
instance, completing Lobachevsky’s formulae by including triangles or right angled 
hexagons with some edges being ultraparallel (that is, having some vertices at infin- 
ity), including also group theory in this domain, making connections with invariant 
theory, etc. 


2 Notes on Study’s paper 


Study uses the terms “distance” and “measure” between points. Both terms were 
used by Klein in his fundamental paper [7] in which he defines distance functions 
associated to conics that give rise to non-Euclidean metrics (see also [1]). One should 
recall here that the terminology and the axioms for a metric space as we understand 
them today were still not standardized, and we cannot expect Study to talk about a 
“metric space” in this sense. As a matter of fact, the axioms for a metric space were 
formulated by Fréchet in his 1906 paper [5] (which was also his doctoral dissertation), 
and by the time Study wrote his paper, this set of axioms was still far from being 
universally adopted as a definition of a metric. 

In conclusion, there was no way in which Study could discuss the question of 
whether his distances and measures give rise to genuine metric spaces in the sense 
we intend them today. 
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In his paper “A sixth memoir upon quantics” [3], Cayley defined a distance on 
the sphere by the formula using cosines in Study’s paper (formula (3) above). Cay- 
ley’s context, like Study’s, is that of projective geometry, but with an absolute cone 
associated to an imaginary conic and given by the equation 


2 2 3 


In his paper [7] (Section 11), Klein used Cayley’s formula to define a distance he 
called a “measure.” He then transformed this formula to obtain the known distance 
formula in terms of the cross-ratio. 

Cayley later expressed his satisfaction regarding Klein’s formula. In his com- 
ments on his paper [3] that he included in Volume II of his Collected mathematical 
papers ([4], p. 604), he writes: 


In his first paper, Klein substitutes, for my cos~! expression for the dis- 
tance between two points, a logarithmic one; viz. in linear geometry if 
the two fixed points are A, B then the assumed definition for the distance 
of any two points P, Q is 

AP.BO 


dist.(PQ) = slog 40 BP’ 


this is a great improvement, for we at once see that the fundamental rela- 
tion, 
dist.(PQ) + dist.(QR) = dist.(PR), 


is satisfied. 


The formulae to which Cayley refers are contained in his paper [3] p. 584. 


Measuring distances of points in the exterior of the hyperbolic space, with the idea 
that these points represent hyperplanes of the space itself, and making trigonometric 
formulae for this exterior geometry is now a well established topic. It is addressed for 
instance in §2.6 of Thurston’s notes on the geometry and topology of 3-manifolds, 
see [14]. 

The reader will have noticed that Study declares in his introduction that the re- 
sults in his article are part of a subject he has been investigating for many years 
“although not yet arriving at anything else than sketchy publications on the topic.” 
It was not unusual, among nineteenth century German mathematicians, to publish pa- 
pers in which they give the important ideas without the details of proofs. Sometimes, 
theorems were stated without the precise hypotheses that describe the conditions of 
their validity. Klein’s “Ueber die sogenannte Nicht-Euklidische Geometrie” [7] is an 
example of a paper written in this style. 
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It would be useful to rewrite Study’s paper with more details, making explicit the 
relation with the de Sitter metric and other works, in particular Poincaré’s quadratic 
geometries ([8] and the prologue of the present volume) and the distance functions 
related to the geometric theory of relativity, e.g. Busemann’s timelike metrics [2]. 


3 A note on Eduard Study 


Christian Hugo Eduard Study (1862-1930) was mainly interested in invariant theory. 
He had a very personal approach to mathematics. Despite the fact that he studied 
at university (first, biology, for one year, in Jena, then mathematics in Strasbourg, 
Leipzig, and Munich), he considered himself as self-taught. Apparently, he did not 
like to listen to lectures but he used to read lecture notes and rethink the material 
in his own way (cf. [6]). In a letter to Klein, talking about Christoffel’s lectures 
in Strasbourg, Study writes: “All these I can obtain in the form of good notes that 
I actually prefer to the lectures themselves. Because here I am free and independent, 
I can study whenever I want, I can skip well-known material or make my own selec- 
tion and thereby proceed much faster than if I would have to sit still on the seat during 
one semester” (our translation from [6], p.50). On the other hand, it appears from 
the biography in [6] that Study appreciated the inspiration he got from his teachers, 
in particular Klein, and admired Lie and Grassmann. 

In 1885, Study obtained a position at the University of Leipzig where Klein and 
Hilbert were working, and, starting from 1894, he taught at the University of Bonn. 
It is believed that Study had a crucial influence on Felix Hausdorff, whose career 
was stagnant, partly because of his extreme modesty, encouraging him and leading 
him towards important research problems. Among Study’s subjects of interest, we 
mention spherical trigonometry; cf. his paper [11] and his book [12]. He interacted 
and corresponded with Lie, Engel, Gordan, Severi and several other mathematicians, 
but he remained a solitary person. 

In a letter to Hurwitz, quoted by Constance Reid in her book [9] (p. 25), Hilbert 
writes: “[Study] is a strange person, and almost completely at opposite poles from 
my nature and, as I think I can judge, from yours too. Dr. Study approves, or rather 
he knows, only one field of mathematics and that’s the theory of invariants, very ex- 
clusively the symbolic theory of invariants. Everything else is unmethodical ‘fooling 
around’ ...He condemns for this reason all other mathematicians; even in his own 
field he considers himself to be the only authority, at times attacking all the other 
mathematicians of the symbolic theory of invariants in the most aggressive fashion. 
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He is one who condemns everything he doesn’t know whereas, for example, my na- 
ture is such I am most impressed by just that which I don’t yet know.” To this harsh 
comment by Hilbert, one may add the more positive words by Klein on Study’s dis- 
sertation: “Dr. Study is of independent, sensitive nature. If he moreover learns to 
submit his subjective impulses to the requirements of given conditions then I hope 
his habilitation will contribute substantially to support mathematical studies at our 
university” (our translation from [6] p. 57). 

Study had an unusual—sometimes awkward—style which we perceive in the pa- 
per that concerns us here. Let us quote Study from an answer he published in the 
Bulletin of the AMS [13] to a critique made by M. Bocher on one of his papers: 
“What I am concerned with is merely the inference which the author leaves to his 
readers. This inference would appear to be that I had been thoughtless, or careless, 
or regardless enough to publish my results in an embryonic state of development. It 
will inevitably be understood that all my theorems are ‘true in general’ only, and con- 
sequently incorrect. [...] My fault (if fault it is) thus consisted in supposing notions 
widely known (and accepted by competent judges) which in reality perhaps were not 
so. Why I held such an opinion is easily explained. The ideas in question are very 
simple. From the standpoint of Klein’s Program of 1872 they appear almost as mat- 
ters of course. To me they have been familiar since about 1884, the time when I was 
a young student under Professor Klein, who in later years was Professor Bécher’s 
teacher also, and to whom also Professor Bécher, as he says himself, is indebted for 
the same ideas. It would seem to follow from this that the public I had in mind was 
likely to exist [...]” 


Let us also quote a citation from a report on an article StudyStudy, Eduard wrote 
while he was a Ph.D. student in Munich, in which he gives a response to a problem 
raised by A. Brill concerning spacial curves of degree 4, and for which he won a 
prize: “In his introduction, the author aims at developing the theory of invariants on 
a geometrical basis and hereby arrives at a—though complicated—geometric inter- 
pretation of the process of symbolic calculation. Even though, concerning the gaps 
in the proofs given in the introduction, one cannot completely agree with the author’s 
opinion about the importance of such investigations, nevertheless one has to praise 
and appreciate the independent spirit and the talent with which the author works for 
his self-chosen goal” (our translation from [6], p. 46) 

Finally, let us mention that Gian-Carlo Rota who edited a reprint of Study’s book 
on ternary forms in 1984 wrote about this book: “the flamboyant German style and 
the scattered remarks, often sparkling and original, make the reading enjoyable.” 

The Ph.D. dissertation by Y. Hartwig [6], entitled Eduard Study (1862—1930)—ein 
mathematischer Mephistopheles im geometrischen Gdartchen, contains other interest- 
ing material on Study, namely correspondence with Klein, Engel and others as well 
as reports on his dissertation, habilitation, etc. 
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1 Introduction 


Most textbooks on classical geometry contain a chapter about conics. There are 
many well-known Euclidean, affine, and projective properties of conics. For broader 
modern presentations we recommend the corresponding chapters in the textbook of 
Berger [2] and two recent books [1] and [13] devoted exclusively to conics. For a de- 
tailed survey of the results and history before the 20" century, see the encyclopedia 
articles [6] and [7]. 

At the same time, one rarely speaks about spherical and hyperbolic conics. How- 
ever, they should not be seen as something exotic. Recall that a projective conic is 
essentially a quadratic cone in IR} (that is, the zero set of a homogeneous quadratic 
polynomial). An affine conic is the intersection of a quadratic cone with an affine 
plane. A Euclidean conic is an affine conic in the presence of a Euclidean metric in 
the plane. 
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Now, a spherical conic is a projective conic in the presence of the canonical metric 
on the space of rays, the distance between two rays being the angle between them. 
In other words, a spherical conic is the intersection of a quadratic cone with the unit 
sphere in IR?. We are interested in the metric properties of this intersection with 
respect to the intrinsic metric of the unit sphere. 

Similarly, a hyperbolic conic is the intersection of a quadratic cone with the upper 
half of the one-sheeted hyperboloid {x9 —x?—x3 = 1} C R?. The Minkowski scalar 
product induces on the hyperboloid a hyperbolic metric. Again, we are interested 
in the metric properties of the curve cut by the quadratic cone on the hyperboloid. 
Central projection from the origin maps the hyperboloid to the disk x? + x} < 1 
in the plane x9 = 1, known as the Beltrami—Cayley—Klein model of the hyperbolic 
plane. The same projection sends a quadratic cone to an affine conic in the plane 
Xo = 1. Thus, in the Beltrami—Cayley—Klein model (which we shall call, for short, 
the Beltrami disk), a hyperbolic conic is simply the intersection of an affine conic 
with the disk depicting the hyperbolic plane. 

On a more formal and coordinate-free level, a spherical or hyperbolic conic is a 
pair of quadratic forms (Q, S) in a 3-dimensional real vector space, such that Q (the 
absolute) is non-degenerate. 

Non-Euclidean conics share many properties with their Euclidean relatives. For 
example, the set of points on the sphere with a constant sum of distances from two 
given points is a spherical ellipse. The same is true in the hyperbolic plane. A reader 
interested in the bifocal properties of other hyperbolic conics can take a look at The- 
orem 6.10 or Figure 37. 

In spherical geometry, polarity associates to a point the great circle centered at that 
point. In hyperbolic geometry, the polarity associates to a point outside the Beltrami 
disk a hyperbolic line polar to that point with respect to the boundary of the disk. 
(Points inside the disk can be viewed as poles of lines disjoint from the disk.) The 
polarity exchanges distances and angles. Because of this, for every metric theorem 
about spherical or hyperbolic conics there is a dual one. For instance, the theorem 
about constant sum of distances from two given points becomes a theorem about the 
envelope of lines cutting triangles of constant area from a given angle. See Figures 14 
and 36. 

The first systematic study of spherical conics was undertaken by Chasles. In the 
first part of [5] he proves dozens of metric properties of quadratic cones by elegant 
synthetic arguments. In the second part he interprets them as statements about spher- 
ical conics. The work of Chasles was translated from French into English and sup- 
plemented by Graves, see also his article [14]. A coordinate approach to spherical 
conics is used in [24] and [13]. Among the works on hyperbolic conics, the article 
of Story [23] can be singled out. It uses with much success pencils of conics and the 
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relation between Q(v, w) and dist(v, w). Story’s article also contains canonical forms 
of equations of hyperbolic conics, for which one can also consult [10] and [11]. 

The present article is to a great extent based on the works of Chasles and Story. 
We were unable to reproduce all of their results; this could have doubled the length 
of the article. The interested reader is referred to the originals. 


2 Quadratic forms, conics, and pencils 
2.1 The dual conic 


Let S be a symmetric bilinear form on a real 3-dimensional vector space V. The 
isotropic cone of the corresponding quadratic form is the set 


Is = {v € V | S(v, v) = Of. 
The form S' defines a linear homomorphism 
Hs:V — V*, (Hs(v),w) = S(v,w). 


The symmetry of S' translates as the self-adjointness of H's with respect to the canon- 
ical pairing {-,-) between V and V*: 


(Hs(v), w) = (v, Hs(w)). 


There are one-to-one correspondences between quadratic forms, symmetric bilinear 
forms, self-adjoint homomorphisms V — V™%*, and (complexified) isotropic cones. 
A projective conic (for brevity, we call it simply conic) is an equivalence class of 
any of the above objects under scaling. By abuse of notation we will often denote a 
conic by the same symbol as the associated symmetric bilinear form; do not forget 
that scaling the form does not change the conic. 

The elements of P(V*) can be interpreted as the lines in P(V). Therefore a conic 
in P(V) is called a point conic, and a conic in P(V*) is called a line conic. 

With every non-degenerate point conic one can associate a line conic. This can 
be done in various ways that turn out to be equivalent: 


* invert the homomorphism Hs; 
* take the image Hs (Js); 


¢ take the set of tangents to the conic. 


Lemma-Definition 2.1. The three constructions listed above result in the same line 
conic, called the dual conic of S and denoted by S*. 
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Proof. The first construction yields the cone 


{f eV" |(Hs'(f), f) = 0}, 


and the second 
{Hs(v) | v € V, (v, Hs(v)) = 0}, 


which is clearly the same. Also, for every v € I's we have 


ker Hg(v) = {w € v | S(v, w) = O},7 


which is the plane tangent to 75 along the line spanned by v. 


Thus for the dual conic S* we have 
Hs» = H5', Is« = Hs(s). 


A degenerate point conic is a pair of lines or a double line, and a degenerate line 
conic is a pair of points or a double point (which means that the conic consists of 
all lines through one of these points, taken twice if the points coincide). The duality 
between non-degenerate point conics and line conics does not extend to degenerate 
ones. The best one can do is to extend the relation Hs 0 Hs» = Id projectively, see 
[21, Section 4.D]. 


2.2 Euclidean vs non-Euclidean geometry 


Let Q be a non-degenerate conic in P(V), hereafter called the absolute. The pro- 
jective transformations from PGL(V) that map (2 to itself preserve the Cayley—Klein 
distance between the points, defined as half the logarithm of their cross-ratio with the 
collinear points on the absolute. They also preserve the Cayley—Klein angles between 
the lines, defined through their cross-ratio with the concurrent tangents to Q (which 
can be interpreted as the cross-ratio of the corresponding points in P(V*) with points 
on the line conic dual to Q). 

For a sign-definite form &2 we obtain the elliptic geometry of P(V); for an indef- 
inite form we obtain hyperbolic (or hyperbolic-de Sitter) geometry. 

Euclidean geometry is the geometry of a degenerate line conic that is positive 
definite and has rank two. Such a conic consists of lines that pass through one of 
two complex conjugate points. The line through these points is real and is fixed 
by the transformations that map the conic to itself. This designates it as the line at 
infinity. A degenerate line conic allows to define the Cayley—Klein angles, but not 
the Cayley—Klein distances. The corresponding transformation group consists of all 
similarity transformations. 
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2.3 Absolute polarity 


A non-degenerate absolute Q corresponds to a self-adjoint isomorphism Hg: V > V* 
Projectivization leads to a well-defined isomorphism between projective planes 


[Ha]: P(V) — P(V"), 


which, together with its inverse, is called absolute polarity. The image of a point 
p € P(V) is called the (absolute) polar of p and denoted by p°. The preimage of a 
line £ € P(V%) is called the (absolute) pole of € and denoted by £°. 


Equivalently, the absolute polar of a linear subspace of V is its orthogonal com- 
plement with respect to Q. 


The points [v],[w] € P(V) such that Q(v, w) = 0 are called conjugate with 
respect to Q. In other words, the polar of a point consists of all points conjugate to it. 


Two lines are called conjugate if (viewed as elements of P(V*)) they are conju- 
gate with respect to the dual quadratic form Q7!. Conjugate lines contain the poles 
of each other. 


Absolute polarity does not change when (2 is scaled, thus it is completely defined 
by the conic Q. The polar of a point lying on the conic is the tangent at that point. 
The fact that absolute polarity preserves incidence between points and lines leads to 
the geometric constructions of poles and polars shown on Figure 1. On this figure, 
the real part of the isotropic cone of (2 is non-empty, that is, the form 2 is indefinite, 
which corresponds to hyperbolic geometry. In spherical geometry, the isotropic cone 
is imaginary. Because of that, a similar geometric construction does not exist, but it 


can be expressed through an algebraic statement. 


Figure 1. Constructing poles and polars 
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2.4 The polar conic 


Let, in addition to an absolute conic 2, another non-degenerate conic S be given. 
Its dual S*, defined in Section 2.1, is a line conic. We can use absolute polarity to 
transform it into a point conic. Again, there are three equivalent descriptions of this 
construction: 


* the homomorphism Hg o Hsx 0 Hg: V > V*; 
* the cone Ho! (Is*) CV; 


* the poles of the tangents to the conic S. 


Lemma-Definition 2.2. The three constructions listed above result in the same conic, 
called the polar conic of S and denoted by S°. 


Proof. The first construction yields the set of all v € V that satisfy 
(Ho(Hs*(Ha(v))), v) = 0. 
Since we have 
(Ho(Hs*(Ha(v))),v) = (Hs*(Ha(v)), Ha(v)), 
this set is the image under Ho ! of the set 
{f eV" | (Hs«(f), f) = 0} = Iss. 


Hence the second construction is equivalent to the first. 
The third construction is equivalent to the second, because the set Hs (5) consists 
of the tangents to the conic S, and the map Hp ! represents absolute polarity. 


As a consequence, absolute polarity transforms conjugacy with respect to S to 
conjugacy with respect to S°. 


Corollary 2.3. A line £ is the polar of a point p with respect to a conic S if and only 
if the point £° is the pole of the line p° with respect to the conic S°. 


Proof. The diagram 
Pvt) 8%) PV) 
| ual | 


pv) 2h pcys) 


is commutative, since Hy> = Hg o Hs o Hg by Lemma-Definition 2.2. 
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2.5 Pencils of conics 


Since quadratic forms on a three-dimensional vector space form a vector space of 
dimension six, projective conics form a five-dimensional projective space. 


Definition 2.4. A pencil of conics is a line in the projective space of conics. 


A pencil spanned by the conics S and T consists of conics of the form AP + “Q. 
The degeneracy condition 
det(AP + ~Q) =0 


is a homogeneous polynomial of degree 3 in A and jz. Therefore either all or at most 
three conics of a pencil are degenerate. 


Example 2.5 (conics through four points). Consider a pencil that contains two pairs 
of lines. If a line of one pair coincides with a line of the other pair, then this line is 
contained in all conics of the pencil, hence all conics are degenerate. If all four lines 
are distinct, then the first pair intersects the second pair in four points, and the pencil 
consists of the conics through these four points, see Figure 2, left. In particular, it 
contains a third degenerate conic made of a third pair of lines. 


Example 2.6 (double contact pencil). Consider a pencil that contains a pair of lines 
£,, & and a double line through the points py € €; and pz € £» (assuming that 
neither p; nor p2 is the intersection point of £; and £2). Then every conic of the 
pencil goes through the points p; and p2 and is tangent to the lines £; and £2 at these 
points (the latter can be seen by viewing the double line as a limit of pairs of lines). 
See Figure 2, right. 


Figure 2. Pencil of conics through four real points and a real double contact pencil 
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There are pencils that involve imaginary elements, for example the pencil of con- 
ics through two pairs of complex conjugate points. 

For more details, including other types of pencils with illustrations, see [1, 3.3], 
[2, 16.4], and [13, 7.3] 


2.6 Dual pencils and confocal conics 


Take a pencil P of line conics, that is, a line in the space of conics in P(V*). The 
non-degenerate conics of P can be dualized (see Section 2.1). This gives a family of 
point conics, called a dual pencil or a tangential pencil. 


Example 2.7. Let P be a pencil of line conics through four lines in general position. 
The corresponding dual pencil consists of non-degenerate point conics tangent to 
those lines. The degenerate members of are three pairs of interection points of the 
four lines. Thus, in a sense, the dual pencil is spanned by the three diagonals of a 
complete quadrilateral. Intuitively, these diagonals are degenerate ellipses tangent to 
the four lines. 


Example 2.8. Let ? be a double contact pencil. The dual pencil is also (the non- 
degenerate part of) a double contact pencil: since the dual conic is made of tangents, 
the dual of a conic tangent to @ at p is a line conic “tangent to” p at £. The degenerate 
members of this dual pencil are a pair of points (the points of contact) and a double 
point (the intersection point of the lines of contact). 


Definition 2.9. If a pencil of line conics contains the dual absolute, then the corre- 
sponding dual pencil is called a confocal family of conics. 


Recall that in Euclidean geometry there is no absolute point conic, but there is 
an absolute line conic, see Section 2.2. Therefore Definition 2.9 also makes sense in 
Euclidean geometry. 

We define the foci of spherical and hyperbolic conics in Sections 3.3 and 5.3 and 
discuss confocal families in Sections 3.5 and 5.7 in more detail. 


Lemma 2.10. Confocal conics intersect orthogonally. 


Proof. This statement means that the tangents of two confocal conics at their inter- 
section point are conjugate with respect to the absolute. Equivalently (in terms of 
line conics), the points of tangency of a common tangent to two conics of a pencil 
are conjugate with respect to any conic from this pencil, see Figure 3. But since these 
points are conjugate with respect to the conics to which they belong, they are so with 
respect to any linear combination. 
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Figure 3. Confocal conics intersect orthogonally, and the dual statement 


Example 2.11. A special case of confocal conics is the dual of the pencil spanned 
by the dual absolute and a double point 2p. If p does not lie on the absolute, then 
this is a double contact pencil spanned by the tangents from p to Q and the double 
polar of p. This pencil is made by the circles centered at p, which can be shown by a 
simple computation. (In the hyperbolic case, p can be hyperbolic, de Sitter, or ideal 
point, see Lemma 6.3.) 


2.7 Chasles’ theorems 


Chasles’ article [4] contains a variety of theorems, whose proofs follow all the same 
principle. A pencil of conics is a line in the projective space of all conics. If in a 
collection of lines there are many concurrent pairs, then all lines are coplanar, and 
hence any two of them are concurrent. Translated to pencils of conics, this means that 
if in a collection of pencils many pairs share a conic, then any two of these pencils 
share a conic. The argument also works for dual pencils, since they correspond to 
lines in the space of line conics, and in particular for confocal families of conics. 


The following theorem is contained in [4]. 
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Theorem 2.12 (Chasles). Let {€;}4_, be four tangents to a conic A. Denote by pij 
the intersection point of ¢; and €;. Assume that the points pi and p34 lie on a conic 
B confocal to A. Then the following holds. 


1. The pairs of points p13, p24 and p14, p23 also lie on conics confocal to A. 


2. The tangents at the points pj; to the three conics that contain pairs of points meet 
at the same point q. 


3. There is a circle tangent to the lines £;, and this circle is centered at q. 


If, in the hyperbolic case, the point q is ideal or de Sitter, then the role of a circle 
centered at q is played by a horocycle or a hypercycle. 


Figure 4. Chasles’ theorem 
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Proof. The idea is to dualize the picture, so that confocal conics become conics 
collinear with the dual absolute, see Section 2.6. 


Consider two pencils of line conics: one spanned by A* and the dual absolute, 
the other made by conics through the lines £; (duals of conics tangent to these lines). 
The second pencil contains the conics A*, py2 + p34, P13 + Pra, and pig + p23 
(the latter three line conics are degenerate, see the last paragraph of Section 2.1). 
Since the two pencils share the conic A*, they lie in a plane in the projective space 
of line conics. 

Consider the pencil of line conics spanned by B* and pj. + p34. This is a double 
contact pencil; it contains the double point 2q¢, where q is the intersection point of 
the tangents to B at p12 and p34. See Figure 5, where the line conics are depicted as 
points. 


Pi3 + pra Pia + P23 


Figure 5. Proof of the Chasles theorem 


The pencil of line conics spanned by pi4 + p23 and 2q and the pencil spanned 
by A* and Q* have a conic in common. Its dual is a point conic confocal with A and 
tangent at the points p14 and po3 to the lines pi4q and p23q. The same is true with 
P13 + p2a in place of p14 + p23, which proves the first two parts of the theorem. 

For the third part, consider the pencil spanned by Q* and 2¢. It consists of circles 
centered at g (for the hyperbolic case, see Lemma 6.3). This pencil intersects the 
pencil of conics through the lines £;, hence there is a circle centered at g tangent to 
those lines. 
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The third part of the theorem is related to the following property of confocal con- 
ics: a billiard trajectory inside a conic is tangent to a confocal conic. 


2.8 Projective properties 


The theorems of Pascal, Brianchon, and Poncelet deal with projective properties of 
conics, therefore they hold for non-Euclidean conics as well as for Euclidean ones. 
However, in the non-Euclidean case one can obtain new theorems from known ones 
by modifying them as follows. 


¢ Apply absolute polarity to some of the elements. 


¢ If there is one or several conics in the premises of the theorem, assume one of 
them to be the absolute conic. 


¢ If there are several conics, assume two of them to be polar to each other. 


The theorems below illustrate this. 


Theorem 2.13. The common perpendiculars to the pairs of opposite sides of a spher- 
ical or hyperbolic hexagon intersect in a point if and only if the hexagon is inscribed 
in a conic. 


Proof. The common perpendicular to two lines £; and £2 is the line through the 
poles 5 and £5. A hexagon is inscribed in a conic if and only if its polar dual is 
circumscribed about a conic. Hence this theorem follows from the Brianchon theorem 
by applying absolute polarity. 


Theorem 2.14. The diagonals and the common perpendiculars to the opposite pairs 
of sides in an ideal hyperbolic quadrilateral meet at a point. 


Proof. The common perpendiculars are the diagonals of the polar quadrilateral, which 
is circumscribed about the absolute. The diagonals and the lines joining the opposite 
points of tangency meet at a point; this is a limiting case of the Brianchon theo- 
rem. 


Theorem 2.15. The set of points from which a (Euclidean or non-Euclidean) conic 
is seen under a right angle is also a conic. 
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Proof. The tangents drawn from a point to the conic are orthogonal if and only if 
they harmonically separate the tangents to the absolute conic. This set of points is 
called the harmonic locus of the conic and the absolute. The harmonic locus of any 
two conics is a conic; this can be proved with the help of Chasles’ theory of (2 — 2) 
correspondences, see [8, Section 50]. 


The set of points from which a curve is seen under a right angle is called the 
orthoptic curve. In the Euclidean case, the orthoptic curve of a conic is a circle. 


Theorem 2.16. Choose any tangent £, to a non-Euclidean conic S. Let ¢ be a 
tangent to S perpendicular to €,, let £3 be a tangent to S perpendicular to £2 and 
different from £1, and so on. Assume that for some n we have €n41 = €1. Then the 
same holds for any other choice of the tangent €). 


First proof. This is the Poncelet theorem for the conic S and its orthoptic conic. 


Second proof. The intersection point of the lines £; and £3 is the pole of £2, which 
lies on S°. Hence £1, £3, €5,...is a Poncelet sequence for the conic S and its polar 
S°, see Figure 6, left. By assumption, for 7 odd it closes after n steps, hence it closes 
after 1 steps for any choice of £;. For n even, we have two sequences that close after 
n 

5 steps. 


lA 
NIA 


Zz 
2 


Figure 6. A non-Euclidean variation of the Poncelet theorem 
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Corollary 2.17. A spherical ellipse inscribed in a regular right-angled triangle can 
be rotated while staying inscribed in that triangle. See Figure 6, right. 


Dually, a spherical ellipse circumscribed about a regular right-angled triangle can 
be rotated while staying circumscribed. See also [13, Theorem 10.1.11]. 


3 Spherical conics 
3.1 A spherical conic and its polar 


Algebraically, a spherical conic is a pair of quadratic forms (Q, S) on a 3-dimensional 
vector space, with Q positive definite. In an appropriate basis we have 


O(v,v) =x" +? +2’, 


Geometrically, a spherical conic is an intersection of the unit sphere in R? with a 
quadratic cone. If the cone is degenerate, then the conic consists of two great circles 
or of one “double” great circle. If the cone is circular, then the conic is a pair of 
diametrically opposite small circles. The most interesting case is when the cone is 
non-degenerate and non-circular, which we always assume in the sequel. 

By the principal axes theorem, there is a basis in which Q keeps its form, while 
the quadratic form defining the cone becomes diagonal: 


52 a. ge 
C={veER?|SQ,v)=0}, Sv =S+5-5, a>db. (1) 
a 
The polar conic is then given by 


C° = {v Ee R? | S°(v,v) = 0}, S°(v, v) = a?x? +b? y? —c?z7. (2) 


Following Chasles, we call the z-axis the principal axis, the x-axis the major 
axis, and the y-axis the minor axis of C. The polar cone has the same principal axis, 
but the major and the minor axes become interchanged. The points where the axes 
intersect the sphere are called the centers of the conic. 


3.2 Projections of a spherical conic 


A spherical conic is a spatial curve of degree 4, but from a carefully chosen point of 
view it takes a more recognizable shape. 


Spherical and hyperbolic conics 277 
Theorem 3.1. The orthogonal projection of a spherical conic along the principal axis 


is an ellipse. The projection along the major axis consists of two arcs of a hyperbola, 
and the projection along the minor axis consists of two arcs of an ellipse. See Figure 7. 


Zz Zz y 


Figure 7. Orthogonal projections of a spherical conic along the axes 


Proof. The pencil of affine quadrics spanned by Q = 1 and S = O contains the 
following three cylinders: 


1 1), ta 
(ata) ea ra)ra- 
i. AAs 1 
Ga) ie 

1 


Byes Geog 


Each of these cylinders intersects the sphere along the conic S. Hence the projection 


| 
| = 


| 
| - re 
So 
< 
IN) 
| 
S| 


of the conic along the axis of a cylinder is the part of an orthogonal section of the 
cylinder that lies inside the unit disk. 


3.3 Foci and focal lines of a spherical conic 


Every quadratic cone has a circular section, a fact that was established by Descartes. 
Any two parallel sections (not passing through the origin) are similar. 


Definition 3.2. A cyclic plane of a quadratic cone is a plane through the origin such 
that every parallel plane intersects the cone in a circle. The intersection of a cyclic 
plane with the unit sphere is a great circle called a focal line of the spherical conic. 


A non-circular cone has two cyclic planes, both passing through the major axis 
and symmetric to each other with respect to the plane spanned by the principal and 
the major axes. 
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Definition 3.3. A cocyclic line of a quadratic cone is the orthogonal complement of 
the cyclic plane of the polar cone. The intersection of a cocyclic line with the sphere 
is an antipodal pair of points called a focus of the spherical conic. 


A non-circular cone has two cocyclic lines, both of which lie in the plane of the 
principal and the major axes. Accordingly, a spherical conic has two foci. 


Theorem 3.4. The cyclic planes of the cone C from (1) are given by the equation 


1 1 1 1 


The cocyclic lines of the cone C from (1) are spanned by the vectors 
(fa? — b+, 0, dvb? +e). 


Proof. The pencil of quadrics S + AQ contains a pair of planes 


1 1 l\ 5 1 l\ 5 
S-39=(5-a)”- (ata). 
Hence the restriction of S' to each of the planes is proportional to the restriction of 
(2. This implies that the sections of C parallel to these planes are circular. 
The formulas for the cocyclic lines of C follow by computing the cyclic planes of 


C° and applying polarity (remember that polarity exchanges the major and the minor 
axes). 


Corollary 3.5. For every spherical conic, the intersection of its cyclic planes with 
the plane of the principal and minor axes are the asymptotes of the hyperbola that 
contains the projection of the conic to that plane. See Figure 7, left. 


Let us now describe some geometric properties of cocyclic lines. 


Theorem 3.6. Two planes through a cocyclic line of a quadratic cone are orthogonal 
if and only if they are conjugate with respect to the cone. 


Proof. By Corollary 2.3, orthogonality with respect to S translates into orthogonality 
of the polars with respect to S°. Polars of planes through a cocyclic line of C are 
lines in a cyclic plane of C°. In this plane, the forms S° and Q are proportional to 
each other; in particular, two lines are orthogonal with respect to S if and only if they 
are orthogonal with respect to Q. 
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cocyclic lines 


cyclic planes 


Figure 8. Planes parallel to cyclic planes intersect a cone along a circle. Planes through cocyclic 
lines are orthogonal if and only if they are conjugate. 


Corollary 3.7. The plane tangent to $* at a focus F of a spherical conic intersects 
the corresponding cone along an ellipse with a focus at F. 


Proof. Let x be the tangent plane to $? at F. Orthogonal planes through a focal line 
intersect 2 along orthogonal lines. Thus by the previous theorem two lines in zr that 
pass through F are orthogonal if and only if they are conjugate with respect to the 
ellipse C M x. This property characterizes the foci of a Euclidean ellipse, see [2, 
Section 17.2.1.6]. 


The following lemma about circular sections will be particularly useful. 


Lemma 3.8. Every sphere passing through a circular section of a quadratic cone 
intersects it along another circle. Conversely, any two non-parallel circular sections 
of a quadratic cone are contained in a sphere. 


Proof. Consider the inversion with center at the apex of the cone that sends the sphere 
to itself. Since the inversion also fixes the cone, it exchanges the two components of 
the intersection. Therefore, if one component is a circle, so is the other. 

For the second part, find an inversion that sends one circular section to the other. 
Then an invariant sphere passing through one of the circles passes also through the 
other. 
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In the limit, as one of the circular sections tends to a point, we obtain the follow- 
ing. 


Corollary 3.9. Every sphere tangent to a cyclic plane at the apex of the cone inter- 
sects the cone in a circle. 


3.4 Directors and directrices 


Definition 3.10. A line through the origin conjugate with respect to a quadratic cone 
to one of its cyclic planes is called a director line of the cone. The intersection of a 
director line with the sphere (that is, the pair of poles of a focal line with respect to 
the conic) is called a director of a spherical conic. 


In other words, the director lines are formed by the centers of circular sections of 
the cone. 


Definition 3.11. A plane through the origin conjugate with respect to a quadratic cone 
to one of its cocyclic lines is called a directrix plane of the cone. The intersection of 
a directrix plane with the sphere (that is, the polar of a focus with respect to a conic) 
is called a directrix of a spherical conic. 


By Corollary 2.3, the directrix of a conic is polar to the director point of the polar 
conic. 

From the formulas of Theorem 3.4 it follows that the director lines of the cone C 
are spanned by the vectors 


(0. by1— 7 +eV1+ =), 


and its directrix planes have the equations 


Va — b? Nee _ 
ar a a 


0. 


3.5 Families of spherical conics 


A spherical conic S' intersects the absolute conic Q in four imaginary points that form 
two complex conjugate pairs (the points are distinct since we assumed the cone to be 
non-circular). 
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Theorem 3.12. The focal lines of a spherical conic are the lines through the complex 
conjugate pairs of intersection points of the conic with the absolute. 


Proof. This follows from the computation in the proof of Theorem 3.4. But here is 
an alternative coordinate-free argument. A cyclic plane of a quadratic cone is a plane 
on which the restrictions of Q2 and S are proportional. In particular, Q2 and S' must 
vanish at the same time. This implies that a cyclic plane is spanned by two common 
(imaginary) isotropic lines of Q and S'. Projectively, a cyclic line is a line through two 
intersection points of Q and S. For this line to be real, the points must be complex 
conjugate. 


As aconsequence, conics that share the focal lines with the conic S form a pencil 
of conics through four imaginary points, spanned by (2 and S. In coordinates, this 
pencil is given by the equations 

1 1) 2 1 a) y2 1 ‘= 
(sa-A)x? + (Ga-A)o?- (ara)? =o. GB) 


One can visualize the conics with common focal lines as follows. 


Theorem 3.13. Every hyperboloid asymptotic to the cone over a spherical conic S 
intersects the sphere along a spherical conic that shares the focal lines with S. 


Proof. A hyperboloid asymptotic to the cone over S is given by an equation of the 


form 
2 2 2 


x y Zz 

ete 
By taking a linear combination with the equation of the sphere, we see that the hy- 
perboloid intersects the sphere along the same curve as the cone (3). 


By definition of the foci and directrices, we have the following. 


Theorem 3.14. A focus of a spherical conic is the intersection point of two (com- 
plex conjugate) common tangents to the conic and the absolute. The corresponding 
directrix is the line through the points where these tangents touch the conic. 


That is, conics confocal with S form a dual pencil to the one spanned by S° and 
Q2 and are given by the equations 


x2 — 72 
i a a a os ar 


Corollary 3.15. Conics that share a focus and a corresponding directrix form a 


0, AS CLA 706727), (4) 


double contact pencil (determined by two complex conjugate points on two complex 
conjugate lines). 
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4 Theorems about spherical conics 
4.1 Ivory’s lemma 


On the turn of the 18" century, Laplace and Ivory [15] computed the gravitational 
field created by a solid homogeneous ellipsoid. Ivory’s solution was immediately 
and widely recognized for its elegance, see [27], Sections 1141 and 1146. Chasles [4] 
further simplified Ivory’s argument; it is under this form that it is presented nowadays, 
see e.g. [12], Lecture 30. In particular, Chasles stated explicitely what is now called 
the Ivory lemma. 

Ivory’s lemma is a statement about confocal conics and quadrics. Its original ver- 
sion deals with confocal quadrics in Euclidean 3-space, but it holds in any dimension 
with respect to any Cayley—Klein metric, see [22]. 

In this section we prove Ivory’s lemma on the 2-sphere. 


Theorem 4.1 (Ivory’s lemma). The diagonals in a quadrilateral formed by four con- 
focal spherical conics have equal lengths. 


Figure 9 illustrates Ivory’s lemma in the Euclidean plane. It can also be interpreted 
as a distorted view of confocal spherical conics. 


Figure 9. Ivory’s lemma 


Take our quadratic cone C and a cone C, given by equation (4) with —1 <A <b?, 
which ensures that the corresponding confocal conics do not intersect. Consider a 
linear map R? > R? given by the diagonal matrix 


Vaz—. Vb2-—2 —) 
= ,—_—_]. 


F, = diag ( ; 
Cc 
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Lemma 4.2. The map F), sends the spherical conic S = C M $* to the conic Sj, = 


C, 1S. Besides, if a point v € S belongs to a conic Sy, then F,(v) also belongs 
to Sy. 


Proof. Letus set A = diag(a’, b?, —c?). Then C, Cy, and C,, are the isotropic cones 
of the matrices 


S=A', S,=(A-Ald)?, S,=(A-pld)}, 
respectively. By definition of Fy, we have 
FR =(A-AMA, 
which immediately implies F,(C) = C). Further, observe that 
|Fa(v) ||? = 0 (A—Ald) Av = v v—Av' A! = lull? —Av' Sv = 1, 


provided that v € $2 C. Hence, v € S implies v € S). 
We also have 
Fy SF, = A7\(A—Ald)(A— wld)! 
=(A-— poy! =1A 1 (A= play! 


(4— wld)! — 24 — pty — 2-4 
bb 
a i 
ey er (1 = “\(4 =ra\ 
in in 
In other words, 


Xr Xr 
Te aT Net 
Fy(v) Sy Fa(v) = rr sv +(1 “° Sv, 


so that v € CM C,, implies v € Cy. 
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Proof of Theorem 4.1. Let S, S,, Sy,, and S,,, be confocal conics such that S' and 
S, are disjoint, and S,,,, S,,. intersect them. Let vj ¢ SM S,; and wi € S49 Sy;, 
i = 1,2. We want to show that dist(v,, w2) = dist(v2, w}). 

By Lemma 4.2, w; = Fy(v;). Since the matrix Fy is diagonal, we have 


v} w2 = 0] Fyv2 = wy] v2, 


which implies the desired equality of diagonals. 


Remark 4.3. The most general form of the Ivory lemma was discovered by Blaschke 
(see [3]): it holds in the coordinate nets of Stackel metrics (in dimension 2 known 
as Liouville nets). Conversely, if a coordinate net on a Riemannian manifold has 
the equal diagonals property, then it is a Stackel net. The argument uses integrals of 
the geodesic flow and is inspired by Jacobi’s ideas from his study of geodesics on 
ellipsoids. For a modern exposition, see [25] and [16]. 


4.2 Bifocal properties of spherical conics 


On the illustrations to the following theorems we are using the projective model of 
the sphere: the projection of the sphere from its center to a tangent plane. A spherical 
conic becomes an affine conic, all great circles (in particular, the focal lines of the 
conic) become straight lines. As a projection plane, it is convenient to choose a plane 
that is tangent at one of the centers of the conic, see Figure 10. 


Figure 10. Projections of a spherical conic to the planes tangent to its centers 


The focal lines in Figure 10, middle, are not the asymptotes of the hyperbola; the 
arrows in the center of Figure 10, right, indicate the position of the foci, which belong 
to the line at infinity. 

Each of the theorems below consists of two parts, dual to each other via absolute 
polarity. Therefore we are proving only one part of each theorem. Usually this is 
the part that deals with the focal lines, because in the space it deals with circles and 
spheres and can be proved by an elegant synthetic argument. All proofs were given 
by Chasles in [5]. 
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Theorem 4.4. 1. For every tangent to a spherical conic, the point of tangency bisects 
the segment comprised between the focal lines. 


2. The lines joining the foci of a spherical conic with a point on the conic form 
equal angles with the tangent at that point. 


YY G2 


Figure 11. The bisector property 


Proof. Interpreted in terms of the corresponding quadratic cone, the first statement 
becomes as follows: any plane tangent to the cone intersects the cyclic planes along 
the lines that make equal angles with the line of tangency. 

Take two planes parallel to the cyclic planes; they intersect the cone in two non- 
parallel circles. By Lemma 3.8, these circular sections are contained in a sphere. 
A tangent plane to the cone intersects the planes of the circles in two lines tangent to 
the sphere, see Figure 13, left. We need to show that these lines make equal angles 
with the line of tangency. But any two tangents to the sphere make equal angles with 


the segment joining their points of tangency, and the theorem is proved. 


Theorem 4.4 can be viewed as a limiting case of the following. 


Theorem 4.5. 1. For every secant line of a spherical conic the segments comprised 
between the conic and the focal lines have equal lengths. 


2. For every point outside a spherical conic, the angle between a tangent through 
this point and the line joining the point to a focus is equal to the angle between the 
other tangent and the line to the other focus. 


OF G&D 


Figure 12. The generalized bisector property 


286 I. Izmestiev 


Proof. The first statement says that every plane through two generatrices of the cone 
intersects the cyclic plane along two lines such that the angle between a line and a 
generatrix is equal to the angle between the other line and the other generatrix. 
Translate the cyclic planes so that they intersect the cone along two circles. By 
Lemma 3.8 these two circles are contained in a sphere. The generatrices and the 
intersection lines of the secant plane with the planes spanned by the circles form a 
planar quadrilateral inscribed in the sphere, see Figure 13, right. Two opposite angles 


of this quadrilateral complement each other to 2, which implies the theorem. 


nd 


Figure 13. Proofs of Theorems 4.4 and 4.5 


Theorem 4.6. 1. A tangent to a spherical conic cuts from the lune formed by the 
focal lines a triangle of constant area. 


2. The sum of the distances from a point on a spherical conic to its foci is constant. 


The first part is dual to the second, because the area of a spherical triangle is equal 
to its angle sum minus zr. Since the angle at the vertex F is constant, the statement 
is equivalent to the constancy of the sum of the angles at X and Y. These angles are 
equal to the lengths of F Z and FZ in the second part of the theorem. 


Figure 14. Bifocal properties: Area(A EX Y) = const., Fy Z + FZ = const. 
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The second part is due to Magnus [18], who proved it by a direct computation. 
Two different proofs are given below: the first one uses differentiation, the second 
one synthetic geometry. The latter is due to Chasles and uses Theorem 4.7 below. 


First proof. When we move a line keeping it tangent to the conic, the instantaneous 
change in the area of the triangle is zero because, by the first part of Theorem 4.4, the 
point of tangency bisects the segment XY. Similarly, for a point moving along the 
conic, the derivative of the sum of its distances from the foci is zero by the second 
part of Theorem 4.4. 


Second proof. The second part of Theorem 4.6 can be derived from the second part 
of Theorem 4.7 similarly to the proof that in a circumscribed quadrilateral the sums 
of opposite pairs of sides are equal. Take two points A and B on a conic such that 
the segments F) A and FB intersect. By Theorem 4.7 there is a circle tangent to the 
lines FA, F, B, F2 A, F)B. Since tangent segments drawn from a point to a circle 
have equal lengths, we have (see Figure 15) 


FiA+PMhA=FPjK+RL=Ea PM+P)N=F, B+ FOB. 


Figure 15. Second proof of Theorem 4.6 


Theorem 4.7. 1. Two tangents to a spherical conic intersect the focal lines in four 
points that are equidistant from the line through the points of tangency. 


2. Four lines joining the foci of a spherical conic with two points on the conic are 
tangent to a circle. The center of this circle is the pole of the line through the two 
points on the conic. 
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Figure 16. Illustration to Theorem 4.7 


Proof. In R°, the first statement says that for any two tangent planes to the cone their 
intersection lines with the cyclic planes form equal angles with the plane through 
the lines of tangency. Theorem 4.4 implies this for the intersection lines of the same 
tangent plane with different cyclic planes. Let us prove this for the intersection lines 
of different tangent planes with the same cyclic plane. Figure 17 shows two such 
lines p; and p2. The shaded triangle lies in the plane L spanned by the tangent lines, 
compare Figure 16, left. 


f! 


Figure 17. Proof of Theorem 4.7 


Translate the cyclic plane parallelly; it will intersect the cone along a circle, the 
tangent planes along lines p{ and p; parallel to p; and pz, and the plane L along 
a line ¢’. The lines p and p make equal angles with the line ¢’, hence they make 
equal angles with any plane through this line, in particular with L. The theorem is 
proved. 
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The second part of Theorem 4.7 is the third part of Theorem 2.12. 


Theorem 4.8. 1. The product of the sines of distances from the points on a spherical 
conic to the focal lines is constant. 
2. The product of the sines of distances from the foci of a spherical conic to its 


tangents is constant. 


Cp 


Figure 18. sina - sinb = const. 


Proof. Take two non-parallel circular sections of the cone. By Lemma 3.8, there 
is a sphere through these two sections. Therefore for every generatrix of the cone 
the product of lengths of the segments between the apex and the circular sections is 


constant: 


OX - OY =const., 


see Figure 19. On the other hand, we have 


OA 
OX = —., 
sina 
OB 
OY = — ., 
sinb 


where OA and OB are the distances from the apex to the chosen planes, and a, b 
are the angles between the generatrix and those planes, that is, the distances from the 
point corresponding to the generatrix to the focal lines of the conic. Since OA and 
OB are constant, the theorem follows. 
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Figure 19. Proof of Theorem 4.8 


4.3 The focus-directrix property 


Recall that for a point on a Euclidean conic its distance from a directrix is in a constant 
ratio to its distance from the corresponding focus. The following theorem provides a 
spherical analog. 


Theorem 4.9. 1. For a tangent to a spherical conic, the sine of its distance from a 
director point is in a constant ratio to the sine of the angle it makes with the corre- 
sponding focal line. 


2. For a point on a spherical conic, the sine of its distance to a directrix is in a 
constant ratio to the sine of its distance from the corresponding focus. 


& WO 


Figure 20. The focus-directrix property and its dual 
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Proof. Recall that a director point is the intersection point of the sphere with a line 
formed by the centers of circular sections of a quadratic cone. Take a circular section 
of the cone. The distance from its center to a plane tangent to the cone is equal to 
r sina, where r is the radius of the circle, and a is the angle between the plane of the 
circle and the tangent plane, see Figure 21. On the other hand, the same distance is 
equal to £ sin b, where is the length of the segment joining the center of the circle to 
the apex of the cone, and 5 is the angle between this segment and the tangent plane. 


Thus we have 

sina £ 
- = — = const. 
sinb r 


At the same time, a is the angle made by the tangent and a focal line, and 5 is the 
distance from the corresponding director to that tangent. The theorem is proved. 


Figure 21. Proof of Theorem 4.9 


4.4 Special spherical conics 


All spherical conics look essentially the same. However, in certain respects some of 
them are special. 


Theorem 4.10. 1. The locus of the points from which a spherical arc is seen under 
a right angle is a spherical conic. The endpoints of the arc belong to the conic and 
are the poles of its cyclic planes. 


2. An arc of length = with endpoints moving along two given great circles is 
tangent to a spherical conic whose foci are the poles of these great circles. 
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NIA 


Figure 22. Special spherical conics 


Proof. The first statement translates as follows. Choose two lines p; and p2 through 
the origin and let two planes rotate around these lines while being perpendicular to 
each other. Then their intersection line describes a quadratic cone whose cyclic planes 
are orthogonal to p,; and po. 


Draw a plane z perpendicular to the line p;. Planes through p; and po2 are per- 
pendicular if and only if their lines of intersection with z are, see Figure 23. Hence 
the intersection line of these planes describes a cone over a circle with the segment 
[pi 7, p21 2] as a diameter. This is a quadratic cone, and the plane z is parallel 
to one of its cyclic planes. The theorem is proved. 


P2 


Figure 23. Proof of Theorem 4.9 
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Lemma 4.11. The spherical conics from the first part of Theorem 4.10 are given by 
the equations 


x2 -y2 7? 
7 Ta OD = 0, a>b 
a b c 
with od = + + sr. The spherical conics from the second part of the same theorem 


satisfy a2 = b? + c?. 


Proof. By polarity, the statements of the lemma are equivalent. Any one of them can 
be proved with the help of the formulas from Theorem 3.4. 


Another special class of spherical conics is formed by those for which the distance 
to a focus is equal to the distance to the corresponding directrix. In this respect, they 
are similar to the Euclidean parabolas. 


Theorem 4.12. The locus of points on the sphere at equal distances from a point and 
a great circle is a spherical conic with each component of diameter >. Equation (1) 
of this conic satisfies a = c. 


Proof. That this curve is a component of a spherical conic follows from (the inverse 
of) the second part of Theorem 4.9 (the focus-directrix property). The two most 
distant points on the curve are the midpoints of the perpendiculars from the point to 
the great circle. 

For a general conic (1), the diameter of a component is equal to the angle between 
the rays spanned by the vectors (a,0,c) and (—a,0,c). This angle is equal to > if 
and only ifa =c. 


Alternatively, one can derive the last theorem from (the inverse of) the second 
part of Theorem 4.6. For a point F and a great circle d, the condition dist(x, F) = 
dist(x, d) is equivalent to dist(x, F) + dist(x,d°) = i where d° is the pole of d 
lying in the same hemisphere with respect to d as F. Thus the “spherical parabolas” 
are also characterized by their foci being the poles of their directrices (the directrix 
corresponding to a focus must be the pole of the other focus). 

For other special spherical conics see [13]. 


5 Hyperbolic conics 
5.1 The hyperbolic-de Sitter plane 


Let Q be a quadratic form on R? of signature (-,+, +). The hyperbolic plane 
is a component of the hyperboloid of two sheets Q(x,x) = —1, equipped with a 


294 I. Izmestiev 


Riemannian metric induced by the form Q. In the Beltrami—Cayley—Klein model, 
the hyperboloid is projected from the origin to an affine plane and becomes the in- 
terior region of a conic, the absolute conic. Points on the absolute are called ideal 
or absolute points. Geodesics in the Beltrami—Cayley—Klein model are straight line 
segments with ideal endpoints. The geodesic distance between two points is half the 
logarithm of their cross-ratio with the collinear ideal points. 


It is convenient to view the plane of the Beltrami—Cayley—Klein model as a pro- 
jective plane; it is then nothing else but the projectivization of R?. On pictures, we 
show an affine chart of this plane, with the absolute in the form of a circle. 


The exterior of the absolute, homeomorphic to an open Mobius band, is called the 
de Sitter plane. Polarity with respect to the absolute conic sends hyperbolic points 
to de Sitter lines (projective lines disjoint from the absolute), ideal points to lines 
tangent to the absolute, and de Sitter points to hyperbolic lines. 


For more details on the de Sitter geometry, see [9]. 


5.2 Classification of hyperbolic conics 


Algebraically, a hyperbolic conic is a pair of quadratic forms (Q, Q) in R?, where 
the absolute form Q has signature (-,+,-+). We assume (Q to be indefinite (thus 
with non-empty isotropic cone) and usually non-degenerate, so that without loss of 
generality it has signature (—, +, +) as well. 


Geometrically, in the Beltrami—Cayley—Klein model, a hyperbolic conic is the 
part of an affine conic inside the absolute. The part outside the absolute may be 
called a de Sitter conic. However, it is convenient to consider both parts at the same 
time. Under absolute polarity, the hyperbolic (respectively, de Sitter) points of a 
conic correspond to the tangents at the de Sitter (respectively, hyperbolic) points of 
the polar conic. 


For a pair of indefinite quadratic forms the principal axes theorem in general does 
not hold. A geometric manifestation of this is the variety of different relative positions 
of two real conics, and hence the variety of different types of hyperbolic conics. 
Following Klein [17], we classify hyperbolic conics according to the multiplicity 
and the reality of their ideal points. 
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Definition 5.1. An intersection point of a hyperbolic conic with the absolute Q = 0 
is called an absolute point of the conic. A common tangent to the conic and the 
absolute is called an absolute tangent to the conic. 


There are four absolute points and four absolute tangents, counted with multiplic- 
ity and including imaginary elements. Imaginary points or lines come in conjugate 
pairs. 


Non-degenerate hyperbolic conics are subdivided into ellipses, hyperbolas, 
parabolas, and cycles. Ellipses and hyperbolas (see Figure 24) have four distinct 
absolute points. Parabolas (see Figure 25) have at least one simple and at least one 
multiple absolute point. Finally, cycles (see Figure 26) have either two double ore 
one quadruple absolute point. 


Conics occupying the same column on Figures 24—26 are dual to each other, with 
the exception of concave hyperbolas and semihyperbolas, which are self-dual. An- 
other self-dual class of conics are osculating parabolas. 


Ellipse Convex hyperbola Concave hyperbola 


ee 


aaa 


De Sitter Ellipse De Sitter hyperbola Semihyperbola 


Figure 24. Hyperbolic ellipses and hyperbolas 
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Sc ec 


Long concave 
hyperbolic parabola 


Osculating 
parabola 


Wide concave 
hyperbolic parabola 


Convex hyperbolic parabola Elliptic parabola 


Small de Sitter parabola Elliptic parabola 


Figure 25. Hyperbolic parabolas 


Oo Ss 


Circle Horocycle Hypercycle 
De Sitter circle De Sitter horocycle De Sitter hypercycle 


Figure 26. Hyperbolic circles 
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5.3 Foci and focal lines of a hyperbolic conic 


Definition 5.2. A focal line of a conic is a line through two of its absolute points. 
A focus of a conic is an intersection point of two of its absolute tangents. 

If a conic is tangent to the absolute, then the point of tangency is considered as a 
focus, and the tangent at this point as a focal line of the conic. 


A focal line and a focus are dual notions: the pole of a focal line is a focus of the 
polar conic. 

Two complex conjugate lines intersect in a real point, and two complex conjugate 
points lie on a real line. Therefore every conic has at least one pair of (possibly 
coincident) real foci and at least one pair of (possibly coincident) real focal lines. 


Lemma 5.3. A real focus of a hyperbolic conic is a hyperbolic, ideal, or a de Sitter 
point at the same time as it lies inside, on the boundary, or outside of the oval bounded 
by the conic. 


Proof. This is quite obvious, and becomes even more obvious in the dual formulation: 
a real focal line is hyperbolic, ideal, or de Sitter at the same time as it intersects, is 
tangent, or is disjoint from the conic. 


Ellipses and hyperbolas have three pairs of focal lines and three pairs of foci, every 
pair corresponding to different matchings of four absolute points or four absolute 
lines. All foci of concave hyperbolas and de Sitter hyperbolas are real and lie in the 
de Sitter plane. The other ellipses and hyperbolas have only one pair of real foci. 

Non-osculating parabolas have two pairs of foci. One pair is real and contains an 
ideal point. The other pair is double, that is, it splits in two pairs if the parabola is 
perturbed so that to become an ellipse or a hyperbola. The double pair of foci is real 
for the concave parabolas and for the de Sitter parabola. The osculating parabola has 
a triple pair of foci, consisting of the point of tangency and of a de Sitter point on the 
corresponding absolute tangent. 

All cycles have a pair of coincident foci: the center of a circle, the ideal point 
of a horocycle, and the pole of the center line of a hypercycle. Hypercycles have in 
addition a double pair of foci: the endpoints of the center line. 


Lemma 5.4. A conic and the absolute induce the same Cayley—Klein metric on each 
focal line and on the pencil of lines through each focus. 
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Proof. A Cayley—Klein distance between two points is determined by their cross- 
ratio with the two (possibly imaginary) collinear points on the conic. A focal line 
intersects the conic and the absolute in the same points, therefore the two metrics on 
this line coincide. 

Similarly, a Cayley—Klein angle between two lines is determined by the cross- 
ratio of these lines with the two tangents from the same pencil. In the line pencil 
through a focus the same two lines are tangent to the conic and to the absolute. 


Compare this with Theorems 3.12 and 3.14 in the spherical case. 


Lemma 5.4 implies the following property of a focus of a hyperbolic conic: for 
any line through the focus, its absolute pole and the pole with respect to the conic are 
collinear with the focus, see Figure 27, left. The Euclidean analog of this is shown 
on Figure 27, right. 


Figure 27. Characteristic properties of foci of hyperbolic and Euclidean conics 


5.4 Axes and centers 


As we already noted, foci and focal lines come in pairs. 


Definition 5.5. The intersection of a pair of focal lines is called a center of the conic. 
The line through a pair of foci is called an hyperbolic conic!axis of the conic. 
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Figure 28. Centers and axes of a concave hyperbola 


All hyperbolic ellipses and hyperbolas, except the semi-hyperbola, have three 
distinct centers and three distinct axes. The case of a concave hyperbola is illustrated 
on Figure 28. 


Lemma 5.6. 1. The centers of a conic are pairwise conjugate with respect to the 
conic as well as with respect to the absolute. 

2. The axes of a conic are pairwise conjugate with respect to the conic as well as 
with respect to the absolute. 


3. A line through two centers is an axis; an intersection point of two axes is a 
center. 


Proof. It is a classical fact that the three diagonal points of a quadrangle inscribed in 
a conic are pairwise conjugate with respect to the conic, see e.g. [2, Section 14.5.2]. 
The quadrangle of the absolute points is inscribed both in the conic and in the abso- 
lute. This implies the first part of the lemma. 

The second part is dual to the first one. 

Three pairwise conjugate lines or three pairwise conjugate points form a self- 
polar triangle. Since two conics in general position have a unique common self-polar 
triangle, it follows that the centers and the axes span the same triangle. 


The third part of the lemma has the following reformulation: for two conics meet- 
ing in four points, the diagonals of the common inscribed quadrilateral and those of 
the common circumscribed one meet at the same point. An alternative proof of this 
is to apply a projective transformation that sends the four intersection points to the 
vertices of a square. 
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5.5 Directors and directrices 


Definition 5.7. The pole of a focal line with respect to the conic is called a director 
point of the conic. The polar of a focus with respect to the conic is called a directrix 
of the conic. 


Compare Definitions 3.10 and 3.11. 


Figure 29. A pair of director points and a pair of directrices 


Lemma 5.8. Director points of a conic lie on its axes. Directrices of a conic pass 
through its centers. 


Proof. Draw a line through a pair of director points. By the basic properties of po- 
larity, the pole of this line with respect to the conic is the intersection point of the 
corresponding focal lines, see Figure 30, that is, a center of the conic. It follows that 
the line through a pair of director points is an axis. 
The second part of the lemma is dual to the first. 


Figure 30. A pair of director points span an axis 


Corollary 5.9. Through every center of a hyperbolic conic pass two focal lines, two 
axes, and two directrices. 
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5.6 Examples 


Let us locate foci and directrices for some types of hyperbolic conics. We assume 
that Q(v,v) = x* + y? — z?, so that in the Beltrami—Cayley—Klein model in the 
plane z = 1 the absolute is the unit circle centered at the origin. 


Example 5.10. By a linear transformation that preserves the absolute, a hyperbolic 
ellipse can be brought to a canonical form 


x? 7 
mtpalh l>a>b. 


The real foci, real focal lines, and real directrices are 


[a2 — b? [1—b? > [1—-P 
(+ ——'9), x = ta a x =a ae 


a 


Vv 1+a2 


the directrices are hyperbolic lines, for larger b they are de Sitter lines, see Figure 31. 


She 


Figure 31. Foci and directrices of hyperbolic ellipses 


The directrices are tangent to the absolute if and only if b = . For smaller 5 


Example 5.11. A semi-hyperbola can be brought to a canonical form 


The only pair of real foci is 


be b? 
A = (———..0). &# = (———— °). 
1—»V/b4 —ab? +1 1+ Vb* —ab? +1 
The focus F{ is a de Sitter point, the focus F> is a hyperbolic point. The correspond- 
ing directrices d, and dz are given by equations 


b? b? 
———$ $<, 58 |, 
ab? —1+ /b*—ab? +1 ab? —1— /b4 —ab? +1 
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respectively. Fora = a (that is, when the semi-hyperbola is represented by a circu- 
lar arc) both directrices are tangent to the absolute, fora < a the directrix d, is de 
Sitter while dz is hyperbolic, finally for a > a the directrix d, is hyperbolic while 
dz is de Sitter, see Figure 32. 

If a = 0 (the semi-hyperbola is represented by a parabola), then d; = F>, and 
dz = Fy. In this case the semi-hyperbola is the locus of points equidistant from the 
point F; and the line d;, see Example 6.6. 


Figure 32. Foci and directrices of semihyperbolas 


Example 5.12. An elliptic parabola can be brought to a canonical form 


_ 1 ay2 2 
goer b=, b 2a. 
a 


(The condition b? < a ensures that the curve stays inside the unit circle.) The only 
pair of real foci: 


2(a — b”) 


The corresponding directrices are given by the equations 


.0), Fy = (1,0). 


_ p2 
2a(a — b*) \ pei. 


—a + 2b2 — ab? 


The directrix d, is tangent to the absolute for a = 2b, hyperbolic for a > 2b, and 
de Sitter for a < 2b?. 


CICiei 


Figure 33. Foci and directrices of elliptic parabolas 


dy ={x=14 
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5.7 Families of hyperbolic conics 


Since the focal lines of a hyperbolic conic are determined by its intersection points 
with the absolute, the conics that share the focal lines form a pencil of conics contain- 
ing the absolute conic. For ellipses and hyperbolas, this pencil is determined by four 
distinct points (some of which can form complex conjugate pairs). For example, if 
all four points are real, then the pencil is formed by convex and concave hyperbolas 
and by three pairs of focal lines. 


By polarity between foci and focal lines, the confocal conics (those sharing a pair 
of foci) form a dual pencil. There are several types of confocal nets of hyperbolic con- 
ics. The simplest ones are formed by concentric cycles and pencils of lines through 
their centers. Confocal nets formed by ellipses, hyperbolas, or parabolas are shown 
on Figures 34 and 35. 


Figure 35. Confocal hyperbolic parabolas 


Conics that share a focus and a corresponding directrix form a double contact 
pencil, compare Corollary 3.15 in the spherical case. 
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6 Theorems about hyperbolic conics 
6.1 Ivory’s lemma 


Similarly to the Euclidean and to the spherical case (see Theorem 4.1), we have the 
following. 


Theorem 6.1 (Ivory’s lemma). The diagonals in a quadrilateral formed by four con- 
focal hyperbolic conics have equal lengths. 


This theorem is proved in [22] by a method similar to that used in the spherical 
case, see Section 4.1: one finds a homomorphism Fy: R* > R? that is self-adjoint 
with respect to and maps one side of the quadrilateral onto the opposite side. 


6.2 Geometric interpretations of the scalar product 


This section deals with metric properties of hyperbolic conics. In the proofs it will 
be more convenient to use the hyperboloid model of the hyperbolic plane. We have 


{x € R? | Q(, x) = —1} = H? U(-B), 


where H? and —H? are two components of the hyperboloid. The distance between 
two points x, y € H? can be computed by the formula 


Q(x, y) = —cosh(dist(x, y)). 
The hyperboloid of one sheet represents a double cover of the de Sitter plane: 
{x € R3 | Q(x, x) = 1} = d8?. 
As before, a de Sitter point x € ds? is the pole of a hyperbolic line 
x° = {y € H? | Q(x, y) = 0}. 


An advantage of ds? over d $7 is that it allows to give a simple description of hyper- 
bolic half-planes: 


x$ = {y € HH? | Q(x, y) > 0}, x2 ={y € H? | Q(x, y) < 0}. 
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The scalar product of two de Sitter points computes the angle or the distance be- 
tween the corresponding hyperbolic lines, and the scalar product of a hyperbolic and 
a de Sitter point computes the distance between a point and a line. For details, see 
Lemma 6.2 below. 

Finally, there is the isotropic cone, which we divide in two parts 


{x € R? | Q(x, x) = 0} = LU(-L), 


where L and —L are one-sided cones asymptotic to H? and —H7?, respectively. For 
x € L, its polar x° is the plane through x tangent to the isotropic cone, which cor- 
responds to a line tangent to the absolute. But there is a more interesting object that 
one can associate with a point of L. Define 


Hy, = {y € H? | Q(x, y) = —]}. 


Then H, is a horocycle centered at x (it is not hard to show that the normals to Hx 
pass through x). As a curve in R?, Hx is a parabola; the central projection makes it 
an ellipse in the Beltrami—Cayley—Klein model. This ellipse osculates the absolute at 
the image of the point x. The scalar product Q(x, y) for x € H? and y € L measures 
the distance between x and Hy. 

To summarize, every vector in H? U L U ds? C R? corresponds to a geometric 
object in the hyperbolic plane: a point, a (co-oriented) line, or a horocycle; the scalar 
product of two vectors measures the distance between the corresponding objects. An 
exact formulation is given in the lemma below. 


Lemma 6.2. 1. If x, y € H?, then 


Q(x, y) = —coshdist(x, y). 


2. fx € H2 and y € d$2, then 
Q(x, y) = sinh dist(x, y°), 


where the point-to-line distance dist(x, y°) is taken positive if the vectors x and y lie 
on the same side from the plane y° C R°, and negative otherwise. 


3. Ifxe H2 and y € L, then 
Q(x, y) = —eilist(e. Hy) 


where the distance to a horocycle is taken to be negative for the points inside the 
horocycle and positive for the points outside. 
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4. If x,y € dS2, then 


cos(Z(x°, y°)) if x° MN y° is hyperbolic, 
Q(x, y) = 441 if x° 1 y° is ideal, 
+ cosh(dist(x°, y°)) if x° N y° is de Sitter. 


Here Z(x°, y°) is an angle between the lines x° and y° occupied by the points z 
where Q(x, Zz) and Q(y, Zz) have different signs. 


S.Ifx € d$* and y € L, then 
Q(x, y) = eflist(x® Hy) 


where the distance between a line and a horocycle is the length of the common per- 
pendicular taken with the minus sign if the line intersects the horocycle. 
6. If x, y € L, then 
Q(x y) = —2eiist(Hx,Hy) 


where the distance between two horocycles is the length of the common perpendicular 
taken with the minus sign if the horocycles intersect. 


This is proved via an appropriate parametrization of the line spanned by the points 
x and y, see e.g. [20] and [26]. 

As a simple application of the previous lemma, we can describe the quadratic 
cones that correspond to hyperbolic cycles. 


Lemma 6.3. For every c € Rand p € R?, the quadratic cone 
Q(x, x) +¢- Q(x, p)? =0 


corresponds in the hyperbolic-de Sitter plane with the absolute QQ to a cycle centered 
at the point p. Namely, it describes 


a circle with the center p if Q(p, p) < 9, 
a horocycle with the ideal point pif Q(p, p) = 9, 
a hypercycle around the line p° if Q(p, p) > 0. 


Proof. The intersection of this quadratic cone with H? C R? is formed by the points 
x with Q(x, p) = const. Lemma 6.2 implies that these are the points at a constant 
distance from p, if p € H?, or the points at a constant distance from a horocycle 
centered at p (hence also a horocycle), if p € L, and the points at a constant distance 


from the line p°, if p € d§?. 
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Results of the following sections are essentially due to Story [23]. However, he 
does not care about the position of foci, focal lines etc., stating the results in terms of 
Q2(x,-) only. Therefore we needed to elaborate on the geometric meaning. 


6.3 The focus-directrix property 
Recall that in the Euclidean case the equation 
dist(x, F) = € - dist(x, d), 


where F is a point and d is a line not passing through F’,, determines for 0 < € < 1 
an ellipse, for € = 1 a parabola, and for e¢ > 1 a hyperbola with a focus F and 
the corresponding directrix d. The following theorem gives a similar description of 
hyperbolic conics. 


Theorem 6.4. Let S be anon-degenerate hyperbolic conic other than a circle, horo- 
cycle, and hypercycle. Let F be anon-ideal focus of S, and let d be the corresponding 
directrix. Then the conic consists of all points x that satisfy the equation 


6(F,x) = «€-d(d,x) (5) 
for some positive constant €, where 


cosh(dist(x, F°)) if F is a de Sitter point, 


O(F.xy= ) 
sinh(dist(x, F)) if F is a hyperbolic point, 


cosh(dist(x,d°)) if d is ade Sitter line, 
5d(d,x) = ¢ etist(x.Ha) if d is tangent to the absolute, 
| sinh(dist(x,d))| if d is a hyperbolic line. 


We need a preparatory lemma. 


Lemma 6.5. Let p be a point not on a conic Q. Then we have 


O(p, p)Q(x, x) a O(p, x)* ice ty (x) t2(x), 


where t, and tz are the (possibly imaginary) tangents to Q through p. 


Here f(x) ~ g(x) means that f(x) = c - g(x) for some non-zero real c. 
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Proof. The three conics Q(x, x), O(p, x), and f(x) - t2(x) belong to the same 
double contact pencil. Thus we have 


AO(x,x) + wO(p, x)? + vty (x)to(x) = 0. 


For x = p the third summand vanishes, which implies A = —yQ(p, p), and the 
lemma is proved. 


Proof of Theorem 6.4. Since the tangents through F to Q coincide with the tangents 
from F to S, we have by Lemma 6.5 


S(F, F)S(x,x) — S(F, x)? ~ Q(F, F)Q(x, x) — Q(F, x)’, 
and hence 
S(x,x) ~ Q(F, F)Q(x, x) — Q(F, x)? — AS(F, x)? 
for some A € R. On the other hand 
S(F,x) ~ d(x) ~ Q(d°, x), 


where d is the directrix corresponding to F. Hence equation Q(x, x) = 0 is equiv- 
alent to 


QF, FYQ(x, x) — Q(F, x)? = pQ(d®, x)’, 
for some jz € R, which is in turn equivalent to 


Q(F, x)? _ Old x)- 


OF PG OG. 


Lemma 6.2 provides metric interpretations of the expressions on the left and the right 
hand side. 


It can be shown that the constant in equation 5 is equal to 


_ [Q(E, £)- S(F,d°) 
<= VS, E)-Q(F, a)’ 


where E is the center of the conic corresponding to the focus F’, that is, the intersec- 
tion point of the directrix d and the polar F° of F. 

A classification of hyperbolic conics according to the nature of the (focus, direc- 
trix) pair and the value of € seems to be missing in the literature. Below we describe 
some special cases without going into details. 
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Let the focus be a hyperbolic point, and the directrix be both hyperbolic. Then 
the equation 
sinh(dist(x, F)) = €- | sinh(dist(x, d))| 
describes 
for 0 < « < e7 “sta. F) 
—dist(d,F) 


an ellipse, 


fore =e an elliptic parabola, 


for e7 dist(d,F) <e< edist(d, F) 


dist(d,F’) 


a semi-hyperbola, 


fore =e a convex hyperbolic parabola, 


dist(d,F) 


fore >e a convex hyperbola. 


This can be shown by restricting the equation to the line through F’ perpendicular 
to d. 


Example 6.6. Fora point F anda line d not through F the locus of points satisfying 
dist(x, F) = dist(x, d) 


is a semi-hyperbola whose other focus is the pole of d, and the corresponding direc- 
trix is the polar of F’, see Example 5.11. In the Beltrami—Cayley—Klein model in the 


unit disk, for F = (c,0) and d = {x = —c} this semi-hyperbola is described by the 


1—c? . 2 
4c y’. 


equation x = 


If the focus F is hyperbolic, and the directrix d is tangent to the absolute or de 
Sitter, then for small values of € we get ellipses. As € increases, the ellipses transit 
through elliptic parabolas to semihyperbolas, see Examples 5.11 and 5.12. 

If for a semihyperbola we take its de Sitter focus, then changing the value of € 
will transform the semihyperbola into a concave hyperbola, passing through a (long 
or wide) concave hyperbolic parabola or, in an exceptional case, through a horocycle. 


6.4 Bifocal properties of hyperbolic conics 
The following theorem is an analog of Theorem 4.8 about spherical conics. 


Theorem 6.7. 1. Let £1, £2 be a pair of lines in the hyperbolic-de Sitter plane. For 
a point x € HH? let 


sinh(dist(x,£;)) if €; is hyperbolic, 
d(x) = \ cosh(dist(x,€?)) if £; is de Sitter, 


ety) if €; is tangent to the absolute. 
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Here H; is a horocycle centered at the ideal point €?, and the distances from a point 
to a line or to a horocycle are equipped with a sign. Then the locus of points that 
satisfy an equation of the form 


di(x)+do(x) =c (6) 


is a hyperbolic conic, and £1, 2 is a pair of its focal lines. Conversely, for every pair 
of focal lines of a hyperbolic conic, the points on the conic satisfy equation (6). 


2. Let pi, p2 be a pair of points in the hyperbolic-de Sitter plane. For a line & in 
the hyperbolic plane let 


sinh(dist(pi,&)) if pi is hyperbolic, 


cost 
i(&) ct if Dj is ideal, 


where H; is a horocycle centered at p;. The distances are equipped with a sign. If pj 
is a de Sitter point, then for all oriented lines § # p? put 
cos(Z(&, p;)) if € O p? is hyperbolic, 
di(&) = 4 +1 ifE OM p> is ideal, 
+ cosh(dist(&, p?)) if & ON p? is de Sitter. 


The sign convention must be chosen in such a way that d;(&) depends continuously 
on &. Then the envelope of the lines that satisfy an equation of the form 


d\(§)-d2(&) =c (7) 


is a hyperbolic conic, and pj, P2 is its pair of foci. Conversely, for every pair of foci 
of a hyperbolic conic, the tangents to the conic satisfy equation (7). 


Proof. Let us prove the first part of the theorem. 


Lemma 6.2 implies that 


Q(x, £5) - Q(x, £5) 
d -d =), ——_ LE 
1() - da(x) ace 

for some constant A. Therefore equation (6) is equivalent to 


Q(x, x) — £14 (x) - £2(x) = 0, 


where £; are certain linear functionals on R? whose kernels are the lines €;. Thus 
the solution set of (6) is the zero set of a quadratic form O = Q — £15, that is, a 
hyperbolic conic. If x is a (real or imaginary) intersection point of Q and Q, then we 
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have £;(x)-£2(x) = 0, therefore the lines £; and £2 pass through all four intersection 
points of Q and Q2. If some of the intersection points coincide (the conic is a parabola 
or acircle), then either one of the lines is an absolute tangent or both lines pass through 
the tangency point (this can be shown by passing to the limit). Hence £1, ¢2 is a pair 
of focal lines of the conic Q. 

In the opposite direction, let O be a conic, and let £1, £2 be its pair of focal lines. 
Then the conics Q, Q, £1 - £2 belong to a pencil. Therefore, up to scalar factors we 
have 


O0=Q-£1-b. 
Thus for all points x on the conic we have £1 (x) - €2(x) = Q(x, x), which implies 


d(x) - d(x) = c for some constant c. 
The proof of the second part is similar. We have 


Q(E°, pr) - Q(E°, p2) 


d -d =A 
i(&) 2(&) Q(E°, £°) 


Therefore equation (7) is equivalent to 
G2(E, &°) — pi (E*) - palE°) = 0, 
where p;(x) is a linear functional proportional to Q2(p;, x). This equation describes 


a hyperbolic-de Sitter conic; its polar is the envelope of the lines that satisfy equa- 
tion (7). 


Example 6.8. The locus of points that satisfy 
sinh(dist(x, £;)) - sinh(dist(x, £2)) = c, 
where £1, £2 are two hyperbolic lines, is 


* aconcave or convex hyperbola or a pair of lines, if the intersection point of f; 
and f2 is non-ideal; 


* aconcave or convex hyperbolic parabola, if the intersection point of f; and fo 
is ideal. 


Example 6.9. The locus of points that satisfy dist(x, H,) + dist(x, H2) = c, where 
H,, Hz are two horocycles, is a hypercycle whose ideal points are the centers of H1 
and H>. The envelope of the lines that satisfy dist(€, H1) + dist(€, Hz) = c is also 
a hypercycle. 
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Theorem 6.10. Let F1, Fz be a pair of foci of a hyperbolic conic. Then the conic 
consists of all points x that satisfy an equation of the form 


1d, (x) + 62(x)| = const. or |d4(x) — d2(x)| = const. 


Here the functions 5;(x) are the distances to the foci or to their polars or to the 
corresponding horocycles: 
dist(x, F;) if Fj is hyperbolic, 
6j(x) = 4 dist(x, F°) if F; is de Sitter, 
dist(x, H;) if F; is ideal, 


where H; is an arbitrary horocycle centered at F;. 


Proof. Let F be a focus of the conic. First assume that F’ does not belong to the 
conic (and hence does not belong to the absolute). Denote by 11, f2 the two absolute 
tangents through F (which are both real or complex conjugate to each other). By 
Lemma 6.5 we have 


O(F, F)Q(x, x) — OF, x)? ~ (x) - a(x) ~ Q(F, FYQ(x, x) — Q(F, x)’, 


which implies 


Q(F, x)? QF, x) 
’ apa ee Ar . (2 ’ = ao) 
mane CED 9 OG. F) 
for some A € R. To compute A, substitute x = E, the center conjugate to the 


axis through F’. Since the directrix corresponding to F also goes through E (see 
Lemma 5.8), we have O(F, E) = 0, so that A = se. 
With the above argument applied to a pair of foci F), F2 (of which we assume 


that both do not lie on the conic), we obtain 


Q(R.x)? — , Qh, x)? 


O(, x) — AG, 2) = CY Ao ey 


fori = 1,2. 


Introduce the linear functions £ (x), £2(x): 
Q(F;, x) 


V Q(F;, Fi) 


Q(F;, x) 


V-Q(F;, Fi) 


if F; is de Sitter, 
{i(x) = 
if F; is hyperbolic. 
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Since the directrices O(F;, x) = O and the polars Q(F;, x) = 0 of the foci meet at 
the center E, each of the linear functions O(F;, x) is a linear combination of £) (x) 
and £2(x). Taking into account that O(F;, F;) has the same sign as Q(F;, Fj), we 
obtain 


0-12 (ail + aizl2)? —AL? _ iff F; is de Sitter, 
—(aj1l1 + ajrl2)? + AL? if F; is hyperbolic. 


We now make a case distinction. 


1) Both foci are de Sitter. We have 
(aly + blo)? —Al2 = OQ-AQN = (chy + dha)? — 222. 
Solving this we obtain 4 = a” — b?. Denoting ps = & we see that 
O ~ (uw? -1)Q + 24+ 3 + wei bo. 
Since by Lemma 6.2 
€i(x) 


Vy —Q(x, x) 


equation Q(x, x) = 0 is equivalent to 


= sinh(dist(x, F°)) =: sinh(d;), 


(1 — px?) + sinh? (5; (x)) + sinh? (52(x)) + 2 sinh(51(x)) sinh(52(x)) = 0, 
which factors as 
(cosh(61 (x) + 62(x)) — 4)(cosh(5; (x) — 52(x)) + “) = 0. 


If jz > O, then the conic is described by |6;(x) + 62(x)| = c; if w < 0, then it 
is described by |6) (x) — 62(x)| = c. 


2 


wm 


Both foci are hyperbolic. In this case we have 
—(al, + bla)? +A@Z = O-IAQN = —(cb, + dh)? +A“, 


which again results in 4 = a? — b?, so that 


Q aw Ge 1)Q fel i Quy lo. 
Now by Lemma 6.2 we have 


€; (x) 


Vy —Q(x, x) 


= — cosh(dist(x, F;)) =: —cosh(6;). 
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Equation O(x, x) = 0 is equivalent to 
pu? — 1 + cosh?(8,) + cosh?(52) + 21 cosh(61) cosh(S2) = 0, 
which factors as 
(cosh(d; + 62) + t4)(cosh(d; — 62) + pz) = 0. 


If 4 > O, then the conic is empty (in fact, it is the de Sitter ellipse). If 4 < 0 
and dist(F,, F2) < arcosh(— 1), then the second factor never vanishes, and the 
conic is described by 6; + 62 = arcosh(—j). If uw < 0 and dist(Fi, F2) > 
arcosh(—/1), then the first factor does not vanish, and the conic is described by 
[51 — 62| = arcosh(—j). 


3 


wm 


Focus F’, is de Sitter, focus F is hyperbolic. We have 
(al, + bl.)? —AZ? = O-AQ = (cl, + dha)? +A. 
This implies A = a? + b?, and we obtain 
O~ (+ y7)Q—O 4+ & + 2b. 
This time we have 
£1 (x) 
(=2.) 


Equation O(x, x) = 0 is equivalent to 


f(x) 


V¥ —Q(x, x) 


= sinh(dist(x, F;)), = —cosh(dist(x, F2)). 


1+ py? + sinh? (6,) — cosh? (82) + 2 sinh(5,) cosh(6) = 0, 
which factors as 
(sinh(6, + 62) + y2)(sinh(d; — 62) + yr) = 0. 


As in the previous case, only one of the factors can vanish (depending on the 
relation between ju and dist(F2, F?)), and the conic is described by one of the 
equations 5; + 62 = const. or 6; — 52 = const. 


It remains to deal with the case when one or both foci are ideal. If a focus F is 
ideal, then we claim that 


O(x, x) —AQ(x, x) = Q(F, x) -m(x), 
where A = Ser as before, and m(x) is some linear function. Indeed, the choice 
of A ensures that the conic Q — AQ goes through the point E. This point belongs to 
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the line Q(F, x) = 0, which is a common tangent of Q and Q at the point F, and 
hence is also a tangent of OQ — AQ at F. It follows that this line is contained in the 
conic Q — AQ, and the statement is proved. 

Now, assuming that in a pair of foci (F;, F) the first one is de Sitter, and the 
second one is ideal, denote £2(x) = Q(F2, x). We have 


(al, + be2)? —A€?2 = Q-AQ = ka-m. 
It follows that A = a”, so that 
O ~ w?Q 4+ eile + £3, 
where 4 = ¢. By Lemma 6.2, we have 


£1(x) =i, <2 2a ge 


V —Q(x, x) Vf —Q2(x, x) 


hence the equation O(x, x) = 0 is equivalent to 
e252 _ pets ob pera = ue = 0, 


which factors as 


(e781 + y)(e%81 — yw) = 0. 


Depending on whether yu is positive or negative, the conic is described by one of the 
equations 6; + 52 = const. or 6; — 52 = const. 

The case of an ideal and a hyperbolic focus is similar. If both foci are ideal, then 
we have 


0 -AQ=aly- bo, 


which implies that O(x, x) = 0 is equivalent to 6; + 62 = const. 


Similarly to the spherical case (Theorem 4.6), Theorem 6.10 has a dual that deals 
with the angles that a tangent to the conic makes with a pair of focal lines (for ul- 
traparallel lines, the angle becomes the length of a common perpendicular). We do 
not list all the possibilities here, but here is one interesting particular case. Take two 
rays in the hyperbolic plane starting at the same point; then the segments with the 
endpoints on these rays that bound a triangle of constant area envelope a branch of a 
convex hyperbola, see Figure 36. 
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Figure 36. A pair of focal lines and a tangent to a convex hyperbola bound a triangle of constant area 


A special case of Theorem 6.10 with F, hyperbolic, Fy de Sitter, and zero differ- 
ence of distances: 


dist(x, F,) — dist(x, F;) = 0 


is described in Example 6.6. 

Theorem 6.10 implies that at every point of the conic the tangent to the conic 
forms equal angles with the gradients of the functions 5, and 62. From this we obtain 
an optical property of the foci, similar to the Euclidean one. 


Theorem 6.11. Let (F), Fy) be a pair of foci of a hyperbolic conic. Then every light 
ray originating from F\ reflects from the conic in such a way that it either passes 
through F» or continues a ray originating from F». 


The rays originating at an ideal or de Sitter point can be defined in two ways: 
either as half-lines in a projective model of the hyperbolic-de Sitter plane or as the 
rays issued by points on a horocycle or on a line in directions orthogonal to that 
horocycle or a line. Figure 37 shows two examples. 

A statement dual to Theorem 6.11 says that every segment tangent to the conic and 
having endpoints on a pair of focal lines is bisected by the point of tangency. Similar 
to the Euclidean and the spherical case, there is a generalization of Theorem 6.11 and 
of its dual, see Theorem 4.5. All these results are proved in [23] in a direct way, but 
the arguments are quite intricate. 

Reflective properties of some hyperbolic conics were described in [19] within the 
Poincaré model of the hyperbolic plane. 
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Figure 37. Optical properties of elliptic parabolas and semi-hyperbolas 
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1 Introduction 


Since the end of the 19" century, and after the works of F. Klein and H. Poincaré, it is 
well known that models of elliptic geometry! and hyperbolic geometry can be given 
using projective geometry, and that Euclidean geometry can be seen as a “limit” of 
both geometries. (We refer to [1] and [2] for historical aspects.) Then, all the geome- 
tries that can be obtained in this way (roughly speaking by defining an “absolute,” 
which is the projective quotient of the isotropic cone of a quadratic form) were clas- 
sified, see [55]. Some of these geometries had a rich development, most remarkably 
after the work of W. P. Thurston at the end of the 1970s, see [65] and [66], which gave 
rise to a highly prosperous understanding of three-dimensional hyperbolic geometry 


' The elliptic space is the standard round sphere quotiented by the antipodal map. 
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(see among others [11], [13], [67], and [4]). On the other hand, the seminal work 
of G. Mess ([46], see also [6]) of 1990 motivated the study in dimension (2 + 1) 
of the Lorentzian geometries of Minkowski, de Sitter and anti-de Sitter spaces (these 
spaces are known under different names in the realm of projective geometries, see [55, 
p. 375] or [64]), which in higher dimensions have attracted interest for a long time 
in mathematical physics and more precisely in General Relativity, see [49], [10], 
and [36]. These geometries are close relatives of hyperbolic geometry and in fact 
Mess outlined their strong relation with Teichmiiller theory, thus giving rise to new 
directions of development which are still very active [6], [15], [17], [16], [8], [9], 
[19], [32], [62], [61], [22], and [18]. 

Moreover, some degenerate spaces appear naturally in the picture, namely co- 
Euclidean space (the space of hyperplanes of Euclidean space), and co-Minkowski 
space (that we will restrict to the space of space-like hyperplanes of Minkowski 
space), first because of duality reasons, and second because they appear as limits of 
degeneration of classical spaces. In fact, co-Minkowski space recently reacquired in- 
terest under the name half-pipe geometry, since the work of J. Danciger [26] and [27], 
which is very related to the idea of geometric transition [37], [25], [42], [28], [29], 
[20], and [21]. 

The purpose of this paper is to provide a survey of the properties of these spaces, 
especially in dimensions 2 and 3, from the point of view of projective geometry. Even 
with this perspective, the paper does not aim to be an exhaustive treatment. Instead it 
is focused on the aspects which concern convex subsets and their duality, degenera- 
tion of geometries and some properties of surfaces in three-dimensional spaces. The 
presentation is intended to be elementary, hence containing no proofs of deep theo- 
rems, but trying to proceed by accessible observations and elementary proofs. Apart 
from some constructions in Sections 5 and 6, which have been obtained in [60], there 
is no claim of originality in the presented results. On the other hand, in this paper we 
attempt to use a modern mathematical language, thus possibly contrasting with the 
point of view of several presentations of classical topics in the literature. 

Hopefully, this survey will provide a unified introduction to the aforementioned 
geometries, and at the same time it might fill to some extent the lack of references 
that, in the opinion of the authors, surrounds the differential-geometric understanding 
of Minkowski, de Sitter and anti-de Sitter geometries, quite differently, for instance, 
from the case of hyperbolic geometry where a large number of textbooks appeared 
since its modern development. 
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2 Model spaces 


2.1 Pseudo-spheres and model spaces 


The set of unit vectors of three-dimensional Euclidean space, endowed with the met- 
ric induced by the ambient space, is a useful model for the round sphere S?. Indeed 
in this model, 


* at a point x € S?, the outward unit normal vector is the vector represented by 
x itself; 


* the isometries of S? are the restrictions of transformations in O(3), namely of 
linear isometries of Euclidean space; 


* the geodesics are the intersections of S? with linear planes, hence are great 
circles.” 


Therefore, using this model, it is easy to show that 
¢ S? is a smooth surface; 


* the isometry group of S? acts transitively on points and on orthonormal frames 
in the tangent spaces; 


* the sectional curvature is | at every point. 


The above properties still hold for the n-dimensional sphere 5”, the set of unit 
vectors of the Euclidean space of dimension (1 + 1), which is a smooth hypersurface 
of constant sectional curvature if endowed with the induced metric. Moreover, it is 
a simple but fundamental remark that the above properties do not use the fact that 
the usual scalar product on the ambient space is positive definite, but only that it is a 
symmetric non-degenerate bilinear form. 

More precisely, if a smooth manifold M is endowed with a smooth (0, 2)-tensor 
which is a non-degenerate symmetric bilinear form, then there is an associated con- 
nection, the Levi-Civita connection,* and the notions of curvature tensor and geodesic 
only depend on the connection.4 For example, in R”*! endowed with any non- 
degenerate symmetric bilinear form b, the Levi-Civita connection is given by the 
usual differentiation in each component, the curvature is zero, and the geodesics are 
affine lines, regardless of the signature of b. 

"2 This follows by contradiction using the symmetry by reflection in the plane and the fact that an 
unparametrized geodesic from a given point is locally uniquely determined by its tangent vector. 

3 The notion of Levi-Civita connection is recalled in Section 5. 


4 Of course, if the tensor is not positive definite, it does not induce a distance on M, and in particular 
there is no notion of “distance minimization” for the geodesics. 
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The isotropic cone (or null cone) of b is the set of isotropic vectors for b: 
J(b) = {x € R"*! | d(x, x) = O}. 
Definition 2.1. A non-empty subset M of R”*! is a pseudo-sphere if 
M = {x €R"*!| b,x) =1} or M={x € R"* | b(x,x) =I}. 
for a non-degenerate symmetric bilinear form D. 


Remark 2.2. Of course, by replacing b by —5, taking 1 or —1 in the definitions leaves 
the topology unchanged, but produces changes at a metric level. More concretely, an 
anti-isometry between two pseudo-Riemannian spaces* (Nj, gi) and (No, g2) isa 
diffeomorphism f: N, —> N2 such that f* g. = —g,. For example, if b’ = —b, the 
pseudo-sphere 

M = {x | b(x, x) = 1}, 


endowed with the metric induced from J, is anti-isometric to 
N = {x | b’(x,x) = —1}, 


when N is endowed with the metric induced from b’, the anti-isometry being the 
identity. As sets, M and N coincide, but if the signature of b is (p,q), the induced 
metric on M has signature (p — 1, q), while the induced metric on N has signature 
(q, p — 1). In the following, given M, the pseudo-sphere N obtained as above will 
be denoted by M. 


Let Ml be a pseudo-sphere. The following facts can be checked similarly to the 
corresponding properties of the unit sphere in Euclidean space: 


* ata point x € M, a unit normal vector is the vector represented by x itself, 


* the restrictions of the transformations of O(p, q), namely the linear isometries 
of b, are isometries® of M, 


* totally geodesic submanifolds are the intersections of JM with vector subspaces. 


5 A smooth manifold is pseudo-Riemannian if it is endowed with a non-degenerate smooth (0, 2)- 
tensor. If the tensor is positive definite at each point, then the manifold is Riemannian. If the tensor has 
a negative direction, but no more than one linearly independent negative direction at each point, then 
the manifold is Lorentzian. 

6 The isometry group of M is bigger than O(p, g) if M is not connected. 
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It follows straightforwardly from the above properties that 


¢ M is a smooth hypersurface, 


the isometry group of M acts transitively on points and on orthonormal frames 
in the tangent spaces, 


the sectional curvature is constantly equal to 1 if 0 = b~!(1) and equal to —1 


if? M = b-1(-1) 


For n > 2, the manifold b~!(1) is diffeomorphic to $?~! x R4, and b~!(—1) is 
diffeomorphic to S4~! x R?, so it may be disconnected. As for the remainder of this 
section, the reference is [49] for more details. But from the definition, if x € JM, then 
also —x € M, hence we will be mainly interested in the projective quotient of M. 
As the action of the antipodal map is clearly isometric, the quotient will inherit in an 
obvious way many of the properties of . 


Definition 2.3. A model space M is a space M := M/{+ Id}, where M is a pseudo- 
sphere, together with the induced metric from b|>.. 


It is obvious that the projective quotient of M is anti-isometric to IM, and will 
thus be denoted by IM. Moreover, in the case where M is not connected, it has two 
connected components, which correspond under the antipodal map x t+ —x. In 
particular, IM is connected. 

By construction, IM is a subset of the projective space R P”. Indeed, topologically 
IM can be defined as 


IM = P{x € R"*! | b(x, x) > 0}, 


or the same definition with > replaced by <, depending on the case. Hence, any 
choice of an affine hyperplane in R”*! which does not contain the origin will give 
an affine chart of the projective space, and the image of IM in the affine chart will be 
an open subset of an affine space of dimension n. 

Recall that PGL(7 + 1, R), the group of projective transformations (or homogra- 
phies), is the quotient of GL(n + 1, R) by the non-zero scalar transformations. Even 
if is not connected, an isometry of M passes to the quotient only if it is an element 
of O(p, q). Hence Isom(IM), the isometry group of IM, is PO(p, q), the quotient of 
O(p, q) by {+ Id} (indeed elements of O(p, g) have determinant equal to 1 or —1). 


7 This follows from the Gauss formula and the fact that the shape operator on M is + the identity. 
The fact that the sectional curvature is constant also follows from the preceding item. 
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2.2 Lines and pseudo-distance 


Pseudo-distance on ll. A geodesic c of M is the non-empty intersection of M 
with a linear plane. This intersection might not be connected. The geodesic c of M 
is said to be 


¢ space-like if b(c,¢) > 0, 
* light-like (or null) if b(¢,¢) = 0, 
* time-like if b(¢,¢) < 0. 


An anti-isometry sends space-like lines onto time-like lines. For example, assume 
that the restriction of b to a plane P of coordinates (x1, x2) has the form ae + eee 
Then the curve 

{x € P | b(x, x) = 1} = {xf + x3 = 1} 
is a space-like geodesic. On the other hand, b’ = —b has the form —x? — x3 and thus 
{x € P| b'(x,x) =—-1} = (x7 +33 = 1} 


is time-like. Actually, up to a global isometry of the ambient space, a non-light-like 
geodesic of M can be written in one of the following forms: 


(al) {x € P | b(x,x) = 1}, where b| p(x, x) = x7 + x3; 
(a2) {x € P | b(x,x) = —1}, where b|p(x, x) = —x? — x2; 


(a3) {x € P | b(x,x) = 1}, where b| p(x, x) = x? — x3; 


(a4) {x € P | b(x,x) = —1}, where b|p(x, x) = x? — x3. 


Up to a change of coordinates, a non-light-like geodesic is a circle or a branch 
of hyperbola. If the geodesic is light-like, it is an affine line in the ambient space. 
It follows from these descriptions that: 


Fact 2.4. Every pseudo-sphere M is geodesically complete: each maximal (for the 
inclusion) geodesic is defined on the entire real line. 


Let x, y be on aconnected component of a geodesic c of M, and let us parametrize 
by y: [0, 1] + c a subsegment of c such that y(0) = x and y(1) = y. The pseudo- 
distance is 


1 
it. y) = / Jbow. rola: (1) 
0 
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If x and y lie on a circle, there are actually two choices for the subsegment of c 
joining them. We choose the one which gives the smaller distance. In particular, in 
this case d (x, y) € [0, x]. Moreover, if c is light-like, then d(x, y) = 0. We have 
d(x, x) = Oand d(x, y= d(y, x), but in general d is not a distance.® 

Considering the cases in (al)—(a4), explicit parameterizations of the circle or the 
hyperbola give: 


(b1) B(x, y) = cos(d(x, y)); 
(b2) —b(x, y) = cos(d(x, y)); 
(b3) b(x, y) = cosh(d(x, y)); 
(b4) —b(x, y) = cosh(d(x, y)). 


Projective distance on IM. We are mainly interested in non-parametrized geodesics 
of M. 


Definition 2.5. A line (resp. plane, hyperplane) of M is the projective quotient of 
a non-empty intersection of JM with a 2-dimensional (resp. 3-dimensional, n-dimen- 
sional) vector space. 


A straightforward property is that the image of a line (resp. plane, hyperplane) of 
IM in an affine chart is the intersection of the image of IM with an affine line (resp. 
plane, hyperplane) of IR”. 

We say that a line of IM is space-like (respectively time-like, light-like) if its lift is a 
space-like (resp. time-like, light-like) geodesic of MM. But also lines can be classified 
with respect to the quadric given by the projective quotient of the isotropic cone. 


Definition 2.6. The projective quotient P J(b) of the isotropic cone is called the ab- 
solute.° 


Definition 2.7. A line of IM is called 
¢ hyperbolic if it intersects the absolute in two distinct points, 
* parabolic if it intersects the absolute in one point, 


* elliptic it it does not intersect the absolute. 


8 It is a distance when M is Riemannian. 

° The absolute corresponds to what is sometimes called the boundary at infinity of IM. We will avoid 
this terminology in the present paper, because some confusion may occur from the fact that in an affine 
chart, the boundary at infinity may be at infinity or not. Note that an absolute is often the boundary at 
infinity of two model spaces. 
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Lemma 2.8. Parabolic and light-like lines coincide. 


Proof. A parabolic line is the projective quotient of a plane P containing a unique line 
belonging to the isotropic cone of b. But a light-like geodesic of is the intersection 
of M with a plane on which the restriction of b is degenerate. 


Recall that an anti-isometry exchanges space-like and time-like lines, but it pre- 
serves the type of a line given by Definition 2.7. Actually the lines in IM whose lifts 
are described in (al )-(a4) have the following corresponding type: 


(cl) space-like elliptic, 
(c2) time-like elliptic, 
(c3) time-like hyperbolic, 
(c4) space-like hyperbolic. 


If x, y are on a line of M, we will define a “distance” between them, which 
roughly speaking corresponds to the pseudo-distance between the corresponding lifts. 
But one has to choose the lifts carefully. 


Definition 2.9. Let x, y be ona line / of M. The projective distance d(x, y) is 
* Oif / is parabolic, 


od (x, y) if / is hyperbolic, and x, y are lifts of x and y on the same branch of 
hyperbola, 


od (x, y) if 7 is elliptic, and x, y are lifts of x and y such that d (x, ¥) is the 
smallest. !° 


With the help of the cross-ratio, one can give a more direct definition of the 
projective distance. Recall that if four points x, y,q, p are given on the projective 
line RP! & RU {oo}, then there exists a unique projective transformation / with 
h(x) = oo, h(y) = 0 and h(q) = 1. Then the cross-ratio can be defined as 


[x,y.q, p] = h(p) € RU {oo}. 


Of course, [x, y,q, p] = coif p = x, [x, y,q, p] = Oif p = y and [x, y,q, p] = 1 
if p = q. More explicitly, a formula is 
X-—G)—P 
[x, ¥.q, p] = ————.. 
Y-GxX—Pp 
Note that this last definition extends to complex numbers, i.e. the definition of the 
cross-ratio extends to C P!. 


10 Tn particular, in this case d(x, y) € [0, Zz). 
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Lemma 2.10. Given two distinct points x, y € M ona line |, then | intersects the 
absolute at two points I,J (may be not distinct, may be in C P!), and 


1 
d(x,y) = inte vst} (2) 


Here \n denotes the branch of the complex logarithm \n: C \ {0} > {z € C:Im(z) € 


(—z, m]}. 


Proof. If I is parabolic, then I = J, [x, y, I, J] = 1 and the result follows. 

Let us consider the case when / is elliptic. A lift of the line containing x and y isa 
connected component of the intersection of a 2-plane P with M. The restriction of the 
bilinear form b to the plane P has necessarily signature (+, +) or (—, —), otherwise 
P would meet the isotropic cone. Without loss of generality, let us consider the case 
(+, +), and let us introduce coordinates such that 


b(x, x) = x? + x2 


andc = PAM = {x € P | b(x,x) = 1}. In the complex plane, the equation 
x? + x3 = 0 is solved by the two complex lines! 


-f]=-[4 


For more about the points I, J see e.g. [53]. 

Let us parametrize c as (cos f, sinf), and let us choose tf, and ty such that d(x, y) = 
|t; — ty|. In the affine chart x. = 1, the line I (resp. J, x, y) is represented by the 
point i (resp. —i, cotan t,, cotan ty). Then we compute 


1 1 ie ad 
5 nlx. y LJ] 7 In[eotan fx, cotan ty, 7, —i] 


(: cotan fy + 1 i cotanty — ~) 
n eS ee 
icotanty + licotant, —1 


1 
2 
1 
P r) 


icotant, + | 1 icotanty + 1 
* icauat, <1)” 2" recy a 


i cotant, — 1 i cotanty — 1 


arcoth(i cotan ¢) — arcoth(i cotan ty) 


i =te 


i(t, — ty), 


'! In the text, we will use brackets to designate a projective equivalence class. 


330 F Fillastre and A. Seppi 


where the last equation follows from cos(x) = cosh(ix), sin(x) = (+) sinh(ix), 
therefore i cotan x = arcoth(ix). This shows that, if / is elliptic, then the expression 
inside the modulus of (2) is purely imaginary. Taking the modulus we obtain 


1 
inte y, I, i = d(x, y). 


If / is hyperbolic, it intersects the absolute in two distinct points I, J, such that 
I and J do not separate x and y. Similarly to the previous case, one can assume 
b,x) =x? —xi on P nde = POM = {x € P | b(x,x) = —1}. Picking 
the lifts of x and y on the same branch of the hyperbola of c, which is parameterized 
by (cosht, sinht), one sees that in the affine line x2 = | the line I (resp. J, x, y) 
is represented by the point —1 (resp. 1, tanh¢,, tanht,). Hence by an analogous 
computation, 


1 1 
3 nlx, y,LJj = 3 In[tanh ¢,, tanh ty, —1, 1] 


1 (aes + -) A (as + -) 
— n —_—_—_ _— ———_ 
2 tanh t, — 1 2 tanh ty — 1 


= ty — ty. 


In this case [x, y, I,J] is a real number, and the formula holds in a way analogous to 
the elliptic case. 


2.3 Usual model spaces 


We will often consider the symmetric bilinear form b on R”*! in a standard form, 
i.e. such that the standard basis of R”*! is an orthonormal basis for b, and if the 
signature of b is (p,q), then the quadratic form is positive on the first p vectors of 
the standard basis, and negative on the other g vectors. In this case, we will denote 
the bilinear form by pq. 


Euclidean and Elliptic spaces. In our notation R”*! is the real coordinate space 
of dimension (n + 1), and we will use the notation E”*! for the Euclidean space 
of dimension (n + 1), that is, R"*! endowed with b,+1,9. The elliptic space Ell" 
is the projective space of dimension n endowed with the spherical metric inherited 
from 8”, namely 


Ell” = {x € R"*? | byy10(%) = 1/{+ Id}. 
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It is a compact Riemannian manifold of dimension 1. Any affine chart corresponds 
to a central projection of the sphere onto a hyperplane. All its lines are space-like 
elliptic, in particular the distance induced by the Riemannian structure is given by 
the projective distance (2). 


Minkowski, Hyperbolic and de Sitter spaces. The space R”+! endowed with 
bn,1 is called the Minkowski space and denoted by Min”* '_ An affine hyperplane Q 
of direction P in Min”*! is called 


* space-like if (bn,1)|p is positive definite; 
* light-like or null if (bn,1)|p is degenerate; 
° time-like if (by,1)|p has signature (n — 1, 1). 


(See Figure | for the n = 2 case.) 


Figure 1. A plane in Min? is space-like (resp. time-like, light like) if its intersection with the 
isotropic cone of b2,1 is reduced to a point (resp. two lines, a single line) 


The hyperbolic space is 
H”? = H”/{+ Id}, 


with 
H? = {x € RR” | ba (x) =—1}, 


endowed with the induced metric. The hyperbolic space is a simply connected com- 
plete Riemannian surface of sectional curvature —1. All its line are space-like hyper- 
bolic, in particular the distance induced by the Riemannian structure is given by the 
projective distance (2). 


332 F Fillastre and A. Seppi 
The other pseudo-sphere of Minkowski space is the de Sitter space 
d$” = {x E R"*! | bai (x) = 1}/{+Id} 


endowed with the induced metric. On any tangent plane to d$”, the restriction of 
bn,1 has Lorentzian signature, i.e. it is non-degenerate with signature (n — 1, 1). De 
Sitter space has constant curvature 1, and is homeomorphic to 8”~! x R. 

Let us focus our attention to the case n = 2. The de Sitter plane d$? has space- 
like elliptic, null and time-like hyperbolic lines. This is more easily seen by taking 
affine models. The most common model is {x3 = 1}. Then H? is the unit open disc, 
and d$? is the complement of the closed disc in the plane, see Figure 2. !? 


space-like elliptic 2 
iii! Ell* ~ 
null parabolic 


ds$3 
time-like 


ds” 


space-like hyperbolic 


EP” 


Figure 2. On the left: types of lines in an affine model of the hyperbolic and de Sitter planes. On the 
right: types of planes in an affine model of the hyperbolic and de Sitter spaces. 


By any affine transformation, any ellipse instead of the circle gives another affine 
model. (These affine transformations are given by the choice of any space-like affine 
plane in Min? which does not contain the origin.) Taking as affine chart a time-like 
or a light-like affine plane of Mim?, we obtain other models of the hyperbolic and 
de Sitter planes: 


Fact 2.11. The convex side of any conic of the plane (i.e. a parabola, ellipse or a 
hyperbola) endowed with the projective distance is an affine model of HH? or H2. The 
other side is an affine model of dS? or 82. 


!2 This is the famous Klein model for the hyperbolic plane. 
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In dimension 3, we obtain: 


Fact 2.12. Any convex side of a non-degenerate quadric of R? (i.e. an ellipsoid, 
an elliptic paraboloid or a two-sheeted hyperboloid) endowed with the projective 
distance is a model for 1? or 13. The other side is a model for dS or d$3. 


It is then immediate that a plane of H? is a hyperbolic plane, and that a plane of 
dS? may be isometric to d$?, to the elliptic plane Ell’, or to the co-Euclidean plane 
* Th? (see 3.5), see Figure 2. 


Anti-de Sitter space. By considering the non-degenerate symmetric bilinear form 
bn—1,2 on R”*1. for n > 2, we define the anti-de Sitter space as 


AdS” = Ad8" /{+Id}, 
with 
AdS" ={x eR" | bp 0(%) = —1} 
endowed with the metric induced from b,_1,2. Anti-de Sitter space is a Lorentzian 


manifold of curvature —1. It is not simply connected as it contains a closed time-like 
geodesic. 


Remark 2.13. It is readily seen that Ad? is anti-isometric to d$*. This is not the 
case for n > 2. 


Let us now focus on the n = 3 case. First note that Ad 8? is anti-isometric to 
by. }(1). Up to affine transformations, A.dS? has two models described as follows: 


Fact 2.14. Any part of R? delimited by a double-ruled quadric (i.e. a hyperbolic 


paraboloid or a one-sheeted hyperboloid) endowed with the projective distance is a 
model of AdS? or Ad$3. 


A plane in AdS? can be isometric to H?, to the co-Minkowski plane * Min? 
(cf. Section 3.5), or to Ad$?, see Figure 3. 


2.4 Convex sets and duality 


Dual cones. A non-empty subset K of R”*! is convex if it contains the segment 
between any two of its points (note that this notion only uses the affine structure of 
R"*!), A convex cone @ of R” is a convex set such that AC C @ for any A > 0. 


334 F Fillastre and A. Seppi 


Figure 3. On the left-hand side, some non-affinely equivalent lines in an affine model of Ad$7. First 
row on the left: space-like hyperbolic, second row: light-like parabolic, third row: time-like elliptic. 
On the right-hand side, some non-affinely equivalent planes in an affine model of Ad$?. First row: 
hyperbolic, second row: co-Minkowski, third row: Ad$?. 


We will also suppose that C is pointed: the only linear subspace it contains is {0}. In 
general, we will also assume implicitly that the cone is closed. 

The dual (R”*!)* of R”*? is the set of linear forms on R”*!, and is naturally 
endowed with a vector space structure of dimension (7 + 1). Note that the notion of 
convexity also holds in (IR’*1)*. Given a convex cone @ in R”, its dual is 


C* = {x* € (R”)* | x*(y) <0, forall y € Ch. (3) 


It is readily seen that the dual of a convex cone is a convex cone, and that 
ACB => B*c A’. (4) 


Recall that a support space of a closed convex set K is a half-space containing K 
and bounded by an affine hyperplane H. If moreover KM H 4 Q, H is a support 
plane of K. A convex set is also the intersection of its support spaces. If K = C isa 
convex cone, then its support planes are linear hyperplanes. Now if x* is a non trivial 
linear form on R”*!, then its kernel is a linear hyperplane, and {y | x*(y) < O}isa 
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half-space bounded by this hyperplane. So C* can be interpreted as the set of support 
planes of ©. As a convex set is the intersection of its support spaces, 


C = {x € R” | y*(x) <0, forall y* € C*}. (5) 
From the formal duality between the definitions (3) and (5), it is readily seen that 
(C*)* = €, (6) 


If R”*! is endowed with a non-degenerate symmetric bilinear form b, then b 
induces an isomorphism between R”*! and (IR”*!)*. Once b, and thus the isomor- 
phism, are fixed, we will still denote by @* the image of @* in R"*!: 


C* = {x € R” | b(x, y) < 0, forall y € C}. 


See Figure 4 for an example with b = b;,;. Let us denote by C the projective quotient 
of AC, for all A € R, ie. 
C= (CU —€) = Id}. 


Remark 2.15. We could have defined as a dual for © the convex set 
e* = {x € R” | b(x, y) = 0, forall y € C}, 


as both definitions agree in the projective quotient: C* = C*’. 


Figure 4. Dual cones in the Minkowski plane 


Convex sets in model spaces. Let J be a pseudo-sphere, © be a convex cone, and 
suppose that JM C is non-empty. Recall that IM is the projective quotient of M, that 
is, M:= M/{+ Id}. 


Definition 2.16. A convex set of IM is the intersection MN C. 
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The image of a convex cone in an affine chart may not be an affine convex set. 
For example the intersection of {z? < x? + y?} with a vertical plane. But the image 
of a convex cone in an affine chart is an affine convex set if the vector hyperplane 
parallel to the hyperplane which defines the affine chart meets the cone only at {0}. 
Moreover in an affine chart, even if C is convex, M M C is not necessarily an affine 
convex set (the image of IM in the affine chart may be non convex). 

Also, a convex set of IM is not necessarily a geodesically convex subset of IM. 
For example, take any convex cone in R? which contains the isotropic cone of b2,1 
in its interior, and is on one side of the horizontal plane. In the affine chart given by 
{x3 = 1}, this gives a convex set of dS”, which is an affine convex set. But it contains 
the absolute in its interior, so it is not a geodesically convex set of dS? (see the big 
triangle in the left-hand side of Figure 5). 


Figure 5. Triangles in dS? dual to triangles in H? in an affine chart. Note that on the right-hand side, 
the dual triangle is not the convex hull in the affine chart of the vertices. 


Duality. The notion of duality for convex sets follows easily from the one for con- 
vex cones. Let us focus on the more relevant cases. Let K be a convex set in H”, 
defined by a convex cone €(K). Using the bilinear form by,1 to identify the ambi- 
ent space R”*! with its dual, the support planes of C(K) are time-like or light-like, 
and the boundary of C(K)* is made of space-like or light-like lines, and hence its 
intersection K* with dS” is not empty. The set K* is a convex set of de Sitter space, 
the dual of K. As the lines of C(K) are orthogonal to the support planes of C(K)* 
by construction, K* is space-like: it has only space-like or light-like support planes. 
Conversely, if K is a space-like convex set in dS”, then we can define in the same 
way its dual convex set K*, which is a convex set of HH”. 

If K is a convex set in Ell”, we obtain similarly a dual convex set K* which is 
in Ell”. Analogously, the dual of a space-like convex set K in AdS? is a space-like 
convex set K* in Ad$. In any case, we have (K*)* = K. 
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The duality is also a duality between points and hyperplanes. For x € M = 
b~1(1), x* is the intersection (if non-empty) of N = b~!(—1) with the hyperplane 
of R”*! orthogonal to x for b. 


This duality points/hyperplane can also be seen in an affine manner. Let us begin 
with the hyperbolic case. A hyperplane in H” meets the absolute along a topological 
sphere S' of dimension n — 2. Then the point dual to the hyperplane is the apex of the 
cone formed by the lines tangent to the absolute along S. This is the usual notion of 
polarity transformation with respect to a (proper) quadric, which is an affine notion, 
see Figure 6 and Figure 7. 


N 
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Figure 6. The dual point of a line in different affine models of the hyperbolic plane 


Figure 7. The hyperplane containing P meets the absolute along a hypersphere S. The point P* is 
the common point of all the lines tangent to the absolute at points of S. 
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We provide an argument, in the n = 2 case, to fix ideas. In the double cover, a 
plane of H? corresponds to the intersection of the pseudo-sphere 


a = {x € R? | bo1(x, x) = —1} 


with a time-like plane P of Mim, which meets the isotropic cone J(b2,1) of Min? 
along two light-like vectors v1, v2. The light-like planes tangent to the isotropic cone 
containing v1, v2 meet along a line directed by a space-like vector v. It is easy to see 
that v is orthogonal to v1 and v2, and hence to P. 

The same holds in anti-de Sitter space: the dual of a space-like plane is a point of 
AdS°, and vice versa, see Figure 8. 

The duality points/hyperplanes suffices to recover the dual of a convex set, see 
Figure 9. Also, it gives the following description of de Sitter space. 


Fact 2.17. De Sitter space dS” is the space of (unoriented) hyperplanes of 1A". 


Figure 9. The dual of a convex set with respect to the sphere is the envelope of the hyperplanes dual 
to its boundary points 
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Remark 2.18. In the elliptic plane Ell’, it is well known that the duality can also be 
expressed in a metric way: the plane x* is the set of points at distance > from x. This 
is readily seen because x* is the projective quotient of a linear plane P in IE, and 
x is the projective quotient of a vector orthogonal to P. The orthogonality for 53,9 
immediately gives that the projective distance is equal to $ due to (b1) in Section 2.2. 

A similar argument leads to the following in Ad$?: the plane x* dual to the point 
x is the set of points at distance > from x (see Figure 8). 

The same computation also occurs in de Sitter space, if one considers the duality 
in the following way. As before, let P be a hyperplane in HH” and let P* be its dual 
point in dS”. Actually P* is also dual to a time-like hyperplane of dS”, the one 
defined by the same affine hyperplane as P, and that we still denote by P. Then in 
d”, the distance between P* and P is oe see Figure 7. 


Comments and references 


¢ When the pseudo-metric is written under the form given by Lemma 2.10, it 
is usually called a Hilbert or Cayley—Klein metric. It can be defined on any 
convex sets, and not only on the ones bounded by quadrics as here, see e.g. 
[50] for more details. 


As Euclidean space is made only of parabolic lines, it is sometimes called 
parabolic space. In dimension 2, the projective distance can be defined di- 
rectly on a quadric: this is the Poincaré quadric geometry [52] and [2] (see 
also the Prologue of the present volume). In particular, this justifies the term 
“parabolic.” 


Remarkably, in dimension 3, the sphere S? has a group structure, namely SU(2), 
with the determinant as the quadratic form on the ambient space. Similarly, 
Ad 8? is SL(2, IR), with minus the determinant as the quadratic form on the 
ambient space. Note that the Lie algebra su(2) (resp. s{(2, IR)) together with 
its Killing form is naturally identified with E? (resp. Mim?). 


The following fact comes readily from the definition of duality: any line from 
x meets x* orthogonally. This is useful in practice, as the affine models are 
certainly not conformal (to the Euclidean metric), and computations of angles 
may be cumbersome, but orthogonality is easily seen, see Figure 7. See Fig- 
ure 10 for an application. In particular, the Klein model of hyperbolic space is 
not a conformal model, as a striking difference with the other famous Poincaré 
model, which will not be used in this survey. 
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Figure 10. A hyperbolic right-angled hexagon is an affine triangle, hence the corresponding mod- 
uli space has dimension 3. Right-angled hexagons are fundamental pieces to construct compact 
hyperbolic surfaces [66] and [65]. 


* The duality in S? induces also a correspondence between angles and length, 


that has a great importance as many statements have a straightforward dual 
analogue. The most basic ones are spherical trigonometric laws for triangles. 


This is also true for a general duality with respect to a symmetric non-de- 
generate bilinear form b. Actually this property is contained in the definition 
of angle. Let P;, P2 be two hyperplanes in MM, and let x € P; M Pz. The 
outward unit normals n; and nz to P; and P2 in T,.M define two points on a 
pseudo-sphere S of 7M, which is naturally isometric to a pseudo-Euclidean 
space. The (exterior dihedral) angle between P; and P2 is the pseudo-distance 
on S between those two points. This does not depend on the choice of the 
point x € Py M P2. But 7;,M is also identified with a hyperplane in the ambi- 
ent R”*!, the one orthogonal to x for b. In particular, n; and 1 are identified 
with points in a pseudo-sphere N, dual to P; and P2. The distance between 
those points of N is exactly the distance between the points in S, hence equal 
to the angle between P; and P. 
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The point is that in 7;M, n1 and nz may belong to two different pseudo- 
spheres, but it is possible to define a pseudo-distance between points belonging 
to two different pseudo-spheres defined by a same symmetric non-degenerate 
bilinear form. We will not use this in the present paper. For trigonometric laws 
for hyperbolic/anti-de Sitter triangles, see [65] and [23]. 


Let Isomg be the connected component of the identity of the isometry group. 
There are famous identifications between Isomp(H*) and PSL(2, R), and be- 
tween the absolute and R P!. In dimension 3, there is an identification between 
Isomo (HH?) and PSL(2, C). The last group also acts on the absolute, which is 
naturally identified with C P’. In contrast, 


Isomo(Ad$?) = PSL(2, R) x PSL(2, R), 


and the absolute is naturally identified with RP! xR P!. 


3 Degenerate cases 


3.1 Geometries 


A geometry is a pair (X,G) where X is a manifold and G is a Lie group acting 
transitively by diffeomorphisms on X. The model spaces we introduced in Section 2 
are geometries, with X¥ = IM and G = PO(p,q). Note that here the group G is a 
subgroup of the group of projective transformations that can be characterized in two 
ways: G leaves IM invariant in the projective space and G is the isometry group of 
IM. We will now be interested by degenerate model spaces. This means that they are 
defined by a degenerate bilinear form (or equivalently, their absolute is not a proper 
quadric in an affine chart). !5 

A degenerate model space gives a geometry (IM, G), with G the subgroup of 
the group of projective transformations that leaves IM invariant. Equivalently, G is a 
subgroup of the group of projective transformations that preserves a degenerate (0, 2) 
tensor on IM. But we will consider the geometries (IM, H), with H a proper subgroup 
of G. The choice of H will be justified by a duality argument in this section, and next 
justified on the one hand by the process of degeneration introduced in Section 4 and 
on the other hand by the definition of connections and volume forms in Section 5. 

Unlike the preceding section, we do not attempt to give a unified treatment of the 
degenerate geometries, but we focus on Euclidean and Minkowski spaces, together 
with their dual spaces: co-Euclidean space and co-Minkowski space. 


!3 In particular, the absolute will contain lines, called isotropic. 
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3.2 Euclidean space 


Euclidean space as a degenerate model geometry. Euclidean space E” may be 
considered as a projective quotient of a pseudo-sphere Euc” of Rt! defined by the 
following degenerate bilinear form: 


b(x, y) = Xn+1)n41, (7) 


1.e. 


Suc Six ER! [a2 


Indeed, Euc” /{+ Id} is the complement of a hyperplane in RP”, i.e. it is identi- 
fied with the affine space A” of dimension 1. Note that in this model, the absolute 
(the projective quotient of the isotropic cone of b) is the hyperplane at infinity. In 
particular, all the lines are parabolic, and hence the projective distance is zero, so we 
do not recover the Euclidean metric. We need more information to recover Euclidean 
geometry from this model. 

The group G of projective transformations leaving Euc” invariant is the group of 
transformations of the form 


(8) 


where A belongs to GL(v, IR) and A ¥ 0, quotiented by the subgroup of multiples of 
the identity, i.e. it is the group of transformations of the form 


Of course one can always choose a representative with A = 1. Under the identifica- 
tion of Guc” /{+ Id} with the affine space A” given by taking the chart {x,+1 = 1}, 
the action of G is identified with the action on A” x {1} of matrices of the form 


(9) 
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Hence (Guc" /{+ Id}, G) is the affine geometry (A”, GL(n, R) = IR”). The Eu- 
clidean geometry (IE”, Isom(IE”)) is (Guc” /{+ Id}, H), where H is the subgroup of 
G such that A in (8) belongs to O(7). 

An element of the form (8) also acts on IR” x {0}, and the action reduces to the 
action of A. As A € O(n), passing to the projective quotient, the hyperplane at 
infinity of A” is endowed with the elliptic geometry Elll”~!. Conversely, suppose 
that (A”, H) is a geometry, where H is a group of affine transformations, whose 
action on the hyperplane at infinity is the one of PO(7). Then necessarily the part A, 
in a representative of the form (8) for an element of H, must belong to O(7). In other 
words, the Euclidean geometry can be characterized as follows. 


Fact 3.1. Euclidean space is projective space minus an elliptic hyperplane. 


Duality of convex sets. Recall that a convex body of IE” is a compact convex set in 
IE” with non-empty interior.!* We will add the following assumption. 


Definition 3.2. A convex body is admissible if it contains the origin in its interior. 


Actually every convex body is admissible up to a translation. It is suitable to 
consider K in BE” x {1} C E”*!, in order to introduce the cone C(K) in E”*! 


over K: 
“w= f() 


Let C(K)* be its dual in E”*! for the scalar product bn+1,0: 


re KAZ 0b. 


C(K)* = {(y, yn41) € R"*? | Bn 410((y. Ynt1), (%, Xn41)) < 0, 


(10) 
for all (x, Xn41) € C(K)}. 


We will denote by K™* the intersection of C(K)* with {ynz41 = —1}. We identify 
{¥n+1 = —1} with E”, so that K* is a closed convex set of IE”. It is readily seen that 


K* = {y € R” | bn o(x, y) < 1, forall x € K}. (11) 


The expression (11) corresponds to the affine duality with respect to the unit 
sphere: a point rv, with v € 8”, on the boundary of K will correspond to a support 
plane of K* of direction orthogonal to v and at distance 4 from the origin. Compare 
Figure 11 and Figure 12. In particular, the dual B, of a ball centred at the origin with 
radius r is the ball B,/, centred at the origin. 


'4 For some authors, a convex body is only a compact convex set. 
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Figure 11. The Euclidean duality is the duality with respect to the unit sphere 


Figure 12. The duality between points and lines in the Euclidean plane expressed in terms of orthog- 
onality in a higher-dimensional Euclidean space 


Spherical, hyperbolic, and other projective geometries 345 


Lemma 3.3. The dual of an admissible convex body in Euclidean space is an admis- 
sible convex body in Euclidean space. 


Proof. If K is a convex body, by definition there exists s,¢ > 0 such that one has 
B; C K C B;. Hence by (4), By, C K* C By /s: K* is bounded and contains the 
origin in its interior. 


Support functions. Let H: IR” — R be such that C(X) is the epigraph of H, i.e. 
C(K) = {(x, Xn41) € R™*! | xng1 = H(x)}. (12) 


The function H is convex, positive outside the origin, and homogeneous of de- 
gree 1: H(Ax) = AA (x) for A > 0. Hence using (12) and (10), 


C(K)* ={0y, 444) = R | Bao x) = eens 
for all (X)Xn41)) Xa = H(x)} 


i.e. 
C(K)* = {(y, Yn+1) € R"*? | bno(y, x) < —ynt1H(x), for all x € R"} 
so K* is determined by H: 


K* = {y €R” | bn o(y, x) < A(x), forall x € R”}. (13) 


Definition 3.4. The function H such that C(K) is the epigraph of H is the support 
function of K*. 


The support function has also the following interpretation. Let v be a unit vector 
of E” x {0}. Then H(v) is the distance in E” x {—1} between {0} x {—1} and the 
support plane of K* directed by v (see Figure 12). Hence (13) expresses the fact that 
K™* is the envelope of its support planes. 


The following fact follows easily from the construction. 


Fact 3.5. The support function provides a bijection between the space of admissi- 
ble convex bodies of IR” and the space of positive convex 1-homogeneous functions 
on R". 
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3.3 Co-Euclidean space 


Definition. On R”*', let b* be the following degenerate bilinear form 
B™(x,¥) = x11 H+ Xan. 
Let co€uc" be the unit sphere for b*: 
coGuc" = {x € R"*! | b*(x, x) = 1}, 


endowed with the restriction of b* to its tangent space. Topologically, co&uc” is 
8” x R. 


Definition 3.6. The space *IE” = co&uc" /{+ Id} is co-Euclidean space. 


The isotropic cone J(b*) is the line € = {(0,...,0,A)}. Let € = Pé. Then 
*Ie” = RP” \{¢}. Lines of *IE” are parabolic if they contain @, and elliptic otherwise. 
Duality. Co-Euclidean space is dual to the Euclidean space in the sense that it can 
be described as the set of hyperplanes of E”. Let P be an affine hyperplane in E”, and 
v its unit normal vector, pointing towards the side of P which does not contain the 
origin 0 of E”. Let h(v) be the distance from 0 to P, i.e. P has equation by o(-, v) — 
h(v) = 0. The vector ( h wy) € R”*! is orthogonal in E”*! to the linear hyperplane 
containing P x {—1}. Its projective quotient defines a point P* in *IE”, see Figure 12. 
One could also consider the other unit normal vector —v of P. The (signed) distance 
from the origin is then —/A(v), and the point —( h (w) has the same projective quotient 


as (nw): 


Fact 3.7. Co-Euclidean space *1:”" is the space of (unoriented) hyperplanes of I”. 


Conversely, an elliptic hyperplane P of *IE” (i.e. a hyperplane which does not 
contain £) is dual to a point P* of IE”. The point P* is the intersection of all the 
hyperplanes x*, for x € P. A co-Euclidean hyperplane of *IE” (i.e. a hyperplane 
containing £) is dual to a point at infinity. 

Let K be an admissible convex body of IE”, and let H be its support function, 
i.e. C(K)* is the epigraph of H. By abuse of notation, let us also denote by K* the 
intersection of C(K)* with co€uc”. The set K* is the epigraph of the restriction h of 
H to co€uc" N {xn41 = 0}, that we identify with 8’—!. Note that by homogeneity, 
H is determined by its restriction to 8"~!, which is actually h: 


H(x) = (xllh(—), 


Ill 


with ||x|| = /Bn.o@. x). 
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Fact 3.8. The convex set K* of co&uc" is the epigraph of h, the restriction to 8"! 
of the support function of K. 


Remark 3.9. The duality between hyperplanes of E” and points in *IE” leads to the 
following relation between angles and length. If P’ is another affine hyperplane of 
IE”, orthogonal to the unit vector v’, then the *IE” segment between P* and P”™ is 
elliptic (or equivalently, the restriction of b* is positive definite on any non-vertical 
hyperplane of R”*!). By the expression (b1) in Section 2.2, cosd(P*, P’*) = 
b*(P*, P’*) and it is readily seen that the last quantity is equal to by o(v, v’). Thus, 
the (exterior dihedral) angles between intersecting affine hyperplanes of IE” are equal 
to the distance between their duals in * IE”. 


A model geometry. Let Isom(b*) be the subgroup of projective transformations 
that preserve b*. There is a natural injective homomorphism 


*:Isom(IE”) —> Isom(b*) 
which is defined as follows. Let 
Pv, h) = {2 € R’ | dno, v) = hb} 


be an affine hyperplane of IE”, v € 8"~!,h € R*. For A € O(n) and? € R”, we 
have 
AP(v,h) +f = P(Av,h+bpo(v, AD), 


from which we define 


(14) 


Definition 3.10. The isometry group of co-Euclidean space, Isom(* IE”), is the group 
of projective transformations of the form 


(5) 


for A € O(n), f € R". 
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Note that Isom(* IE”) is a proper subgroup of the group of isometries of b*. For 
instance, the latter also contains homotheties of the form [diag(1,...,1,4)]. They 
correspond to displacement along parabolic lines, see Figure 13. 


Figure 13. In this affine model of the co-Euclidean plane, the absolute {¢} is at infinity, in such a 
way that the vertical lines are parabolic. Also the quotient of {1,—1} x R Cc 8! x R is at infinity. 
On the left is the effect of the dual of a Euclidean homothety on an elliptic line of the co-Euclidean 
plane, and on the right is the effect of the dual of a Euclidean translation on an elliptic line of the 
co-Euclidean plane. 


Co-Euclidean space is naturally endowed with a degenerate metric g*, which is 
the restriction on co €uc” of b*, pushed down to the projective quotient, and coincides 
with the elliptic metric on the elliptic hyperplanes, and is zero on the parabolic lines. 

The isotropy group /(p) of a point p € *EE” corresponds to the isometries of 
IE” that fix the dual hyperplane p”*, i.e. all the translations by vectors in the direction 
of p*. The group /(p) actually pointwise fixes the parabolic line passing through 
Pp, because each point on the line is dual to a hyperplane parallel to p* in E”. By 
a similar argument, /(p) preserves every other parabolic line (not through p) and 
acts on it by translation. Finally, the group /(p) acts simply transitively on all the 
elliptic hyperplanes containing p, because two such hyperplanes a, b are dual to two 
points a*, b* contained in the hyperplane p* of IE”, and there is a unique translation 
in the direction of p* sending a* to b*. The stabilizer in /(p) of any such elliptic 
hyperplane clearly acts on it as O(n — 1). 

As a result of this discussion, let us show that any (0, 2)-tensor on *E” invariant 
under the action of Isom(*IE”) must necessarily be degenerate. Let g be a bilinear 
form on 7,* IE” invariant under the action of /(p), and fix a vector X € T,*Ik” 
which is not tangent to the parabolic line through p. By the above discussion, there 
is an element of J(p) which maps X to X + V, where V is a vector tangent to the 
parabolic line. Hence if /(p) preserves the bilinear form g, then V is null for g. 
Hence g cannot be a scalar product. Since /(p) acts transitively on lines spanned by 
vectors X as above, all such vectors X must be of the same type, and therefore g is 
degenerate. 
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Fact 3.11. There is no pseudo-Riemannian metric on (* I" , Isom(*E”)). 


In fact, the argument above essentially shows that the metric must be of the form 
described above (up to a factor), namely when lifted to the double cover co€uc”, 
it restricts to the spherical metric on every hyperplane transverse to the parabolic 
line. 


3.4 Minkowski space 


Minkowski space as a degenerate model geometry. Like Euclidean space, Min- 
kowski space is a subgeometry of the affine geometry, and isometries of Minkowski 
space are of the form 


with A € O(n — 1,1). Such a transformation also acts on IR” x {0}, and the action 
reduces to the action of A. As A € O(n — 1, 1), passing to the projective quotient, 
the hyperplane at infinity of A” is endowed with the hyperbolic and the de Sitter 
geometries. Conversely, suppose that (A”, H) is a geometry, where H is a group 
of affine transformations, whose action on the hyperplane at infinity is the one of 
PO(n — 1, 1). Then necessarily the part A for the representative of an element of H 
must belong to O(m — 1, 1). This can be summarized as follows. 


Fact 3.12. Minkowski space is the projective space minus a hyperbolic-de Sitter hy- 
perplane. 


Duality of convex sets. An affine space-like hyperplane P splits Mim” into two 
half-spaces. The time-orientation of Mim” allows to speak about the future side of 
P. A convex set is future convex if it is the intersection of the future of space-like 
hyperplanes. Note that a future convex set may have also light-like support planes 
(e.g. the future cone of a point), !> but no time-like support plane. 


Fact 3.13. A future convex set contains the future cone of any of its points. 


'S The future cone of a point is the union of all the future directed time-like or light-like half lines 
from the point. 
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Let us denote by J the closure of the future cone of the origin minus the origin, 
Le. 


F := {x € R” | by-1,1(%, x) < 0, Xn > O}. (16) 


Definition 3.14. An admissible convex set of Min” is a future convex set contained 
in F. 


Up to translation, any future convex set contained in the future cone of a point is 
an admissible convex set. But not all future convex sets are admissible, even up to 
translation, for example consider the future of a single space-like hyperplane. 


Let K be an admissible convex set of Mim”, and let us identify Mim” with {—1} x 
Min” in Min”*?!. Then the cone C(K) in Min”*! over K is 


“w= 


Note that C(K) is not closed as it contains points with zero first coordinate in its 
closure. 


Let C(K)* be its dual in Min”*! for bn: 


ve Krzol, 


C(K)* = {(y1, 9) ER" | baa (yi. y), 1, x) < 0, for all (x1, x) € C(K)}. 
(17) 
We will denote by K* the intersection of C(K)* with {y; = —1}. As we identify 
{y, = —1} with Mim”, K* is aclosed convex set of Mim”, and it is readily seen that 


K* = {y € R” | bn-1,1(%, y) < -1, forall x € K}. (18) 


This corresponds to the affine duality with respect to the unit hyperboloid, compare 
Figure 14 and Figure 15. The dual of H,, the future convex side of a branch of 
hyperboloid with radius r, is H,/,. The fact that Hy = H, comes also from the 
following: the boundary of C( 1) is (a part of) the isotropic cone J(bn,1), which is 
dual to itself for by 1. 

Let us call admissible cone the future cone of a point contained in the interior of F, 


and admissible truncation the intersection of F with the future side of a space-like 
hyperplane P such that the origin is in the past of P, see Figure 16. 
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Figure 14. The duality of admissible convex sets in Minkowski space is the affine duality with respect 
to the upper hyperboloid 


Figure 15. The duality between points and lines in the Minkowski plane expressed in term of or- 
thogonality in a higher-dimensional Minkowski space 
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Figure 16. The dual of an admissible truncation is an admissible cone 


Lemma 3.15. The dual of an admissible truncation is an admissible cone, and vice 
versa. 


Proof. Let K be an admissible truncation, i.e. if v is the unit future vector orthogonal 
to P and if r is the distance from the origin to P, then 


K= {x € Min” | Dep il esd) <0, by—1,1(% —Trv, v) < O}. 


From (18), we see that ty € K*. From Fact 3.13, if C is the future cone of ty, then 
CC kK*. 
On the other hand, as ty e C,if y € C%, then by (18), Dp_1i(y,v) < —r, 


which implies that y € K, hence C* C K. The result follows from (6) and (4). 


Lemma 3.16. The dual of an admissible convex set of Minkowski space is an admis- 
sible convex set of Minkowski space. 


Proof. Let K be an admissible convex set. By Fact 3.13, K contains the future cone 
of a point, and by (4) and Lemma 3.15, K* is contained in an admissible truncation, 
in particular K* is in F. 

As K is in J, there exists a hyperplane separating K from the origin, in particular 
K is contained in an admissible truncation. By (4) and Lemma 3.15, K* contains the 
future cone of a point. From this, it is easy to see that the closed convex set K* must 
be a future convex set. 


Support function. Let H:IR” + RU {+00} be the convex function whose graph 
is the cone over the boundary of K. In particular, C(K) is the intersection of the 
epigraph of H with {x; > 0}: 


C(K) = {(x1, x) € Reo x R” | x1 > A(x)}. (19) 
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The function A is non-positive, convex and homogeneous of degree one. Let dom H 
be the domain of H, the set of points where HT takes finite values. By Fact 3.13, it 
is easy to see that any future time-like ray from the origin meets the boundary of K 
exactly once, hence 

int CdomA CF, 


(here F is the set (16) for the Minkowski structure induced on {0} x R” by that of 
the ambient Min”*'), Let v € OF, ie. v is a future light-like vector. If there exists 
aA > Osuch that Av € K, then v € dom H. Otherwise, as from (19), 


K ={xeF| A(x) <—l}, (20) 


we would have H(Av) > —1. So, by homogeneity, 0 > H(v) > —1/A for all A > 0, 
hence H (v) = 0, and the domain of H is F. 

As the epigraph of H is closed, H is lower semi-continuous, hence it is deter- 
mined by its restriction to int F. See Section 7 in [54] for details. We will denote by 
H the restriction of H to int ¥. The function H is negative. 


Definition 3.17. The function H is the support function of K*. 
The exact relation between H and K™ is given by the following lemma. 
Lemma 3.18. Let H be the support function of K*. Then 
K* = {y € R” | bn-1,1(y, x) < H(x), for all x € int F}. (21) 
Proof. By (18) and (20) 
K* = {y € BR” | bp-1,1(x, y) < —1 forall x € F, A(x) < -1}, 
So it is straightforward that the set 
A={y €R” | bn-11@, y) H(x) for all x € F} 


is contained in K*. Conversely, suppose that y € K* and let x € F. If H(x) = 0, 
then y € A. Otherwise, by homogeneity, 


SO 


Le. bn—11(x, y) < H (x), so y € A. By lower-semi continuity, the right-hand side 
of (21) is equal to A. 
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The support function also has the following interpretation. Let v be a unit vector 
of {0} x int ¥. Then H(v) is the distance in {—1} x Mim” between the origin and the 
space-like support plane of K* directed by v (see Figure 15).!° Hence (21) expresses 
the fact that K* is the envelope of its space-like support planes. 

The following fact follows easily from the construction, 


Fact 3.19. The support function provides a bijection between the space of admissible 
convex subsets of Mim” and the space of negative convex 1-homogenous functions 
on the cone int F. 


3.5 Co-Minkowski space 


Definition. On R”*!, let b* be the following degenerate bilinear form 


b* (x, y) = X2yo +++ + XnYn — Xn+1Yn4t1- 
Let coMin” be the unit sphere for b*: 
coMin”™ = {x | b*(x,x) = —1}, 


endowed with the restriction of b* on its tangent space. Topologically, coMin” is the 
real line times an open disc. Any section of coMin” by a hyperplane not containing 
a horizontal line is isometric to FH”. 


Definition 3.20. The space *Min” = coMin” /{+ Id} is co-Minkowski space. 


The isotropic cone J(b*) contains the line t= {(A,0,...,0)}. Let us call £2 = Pé 
the vertex of the absolute. Hyperplanes of co-Minkowski space which do not contain 
the vertex of the absolute are hyperbolic. Lines passing through the vertex of the 
absolute are parabolic. 


Fact 3.21. A strip between two parallel lines or the interior of a cone in the plane 


with the projective distance is a model of *Mimn? or *Min?. 


The two lines in the fact above meet at the vertex of the absolute. In an affine 
chart, in dimension 3, we have the following, see Figure 17. 


Fact 3.22. The convex side of a ruled quadric in R? (i.e. elliptic cone or elliptic 
cylinder or hyperbolic cylinder) with the projective distance is a model of *Min? or 
*Min?. 

‘© In other terms, the Lorentzian orthogonal projection of the origin onto the support plane is the 
point —H(v)v. 
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Figure 17. Projectively equivalent affine models of 3d co-Minkowski space. In each case, a hyper- 
bolic plane and a parabolic line are drawn. 


Duality. Let P be an affine space-like hyperplane in Mim”, and v its future unit 
normal vector. Let h(v) be the Lorentzian distance from 0 to P, i.e. the future of 
P has equation by—-1,1(¢, v) < —h(v). The vector lew € R"*! is orthogonal in 
Min”*! to the linear hyperplane containing P x {—1}. Its projective quotient defines 
a point P* in *IMim”, see Figure 15. One could also consider the past unit normal 
vector —v of P. The (signed) Lorentzian distance from the origin is then —h(v), and 
the point = my) has the same projective quotient as ae), 

Fact 3.23. Co-Minkowski space * Mim" is the space of (unoriented) space-like hy- 
perplanes of Min”. 


Conversely, a hyperbolic hyperplane of * Min” (i.e. a hyperplane of * Min” which 
does not contain £) is dual to a point of Mim”. A co-Minkowski hyperplane of * Min” 
(i.e. a hyperplane of *Mim” which contains £) is dual to a point at infinity. 

Let K be an admissible convex subset of Min”, and let H be its support function. 
By abuse of notation, let us also denote by K* the intersection of C(K)* with coMin”. 
The convex set K* is the closure of the epigraph of the restriction h of H to coMin" M 
{x ; =0}, which we identify with the upper part of the hyperboloid: 


i Sa Tite SO): 

Note that by homogeneity, H is determined by its restriction h to Je 
A(x) = ||xll-AQ@/Ilxll-), 

with |[x||— = Y/Dn—1,1 (x, x). 


Fact 3.24. The convex set K* of coMin” is the epigraph of h, the restriction to vie 
of the support function of K. 
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Note that the function h: H"~! — R can be extended by symmetry to H"~!, in 
such a way that the projective quotient is well defined on H”~!. 


Cylinder model. There is another convenient way to describe the dual of admissi- 
ble convex sets of Minkowski space in co-Minkowski space, by looking at the affine 
chart {x,41 = 1}. In this model, * Min” is an elliptic cylinder B”~! x R, with B”~! 
the open unit disc. 

Let H be the support function of K*. The homogeneous function H is also 
uniquely determined by its restriction h to B”~! x {1}. The intersection of C(K) with 
{Xn+1 = 1} is the graph of h. As a restriction of a convex function to a hyperplane, 
h is a convex function. Conversely, it can be easily seen that the 1-homogeneous 
extension of a convex function B”~! x {1} is a convex function on the future cone of 
the origin. See e.g. [14, Lemma 2.6]. 


Fact 3.25. There is a bijection between negative convex functions B"~! —> R and 
admissible convex subsets of Minkowski space Min”. 


Let us denote by K* the intersection of C(K)* with {x,4,; = 1}. By (21), the 
convex function h determines K*: 


«=f lmnu( (3) (0)) =H} 


The way to go from K* to K® is by a projective transformation sending the origin to 
infinity, see Figure 18. 


Figure 18. The projective transformation C) > (2/ if gives K * from K*. A hyperbola is sent to 
a parabola. 
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A model geometry. Let Isom(b*) be the subgroup of projective transformations 
that preserves b*. There is a natural injective homomorphism *:Isom(Min”) —> 
Isom(b* ) which is defined as follows. Let 


P(v,h) = {z € R" | ba-ia(z, v) =f} 


be a space-like affine hyperplane of Mim”, v € H%~',h € R*. For A € O(n — 1, 1) 
and f € R”, we have 


AP(v,h) +i = P(Av,h+ bp_11(v, A‘), 


from which we define */ for J € Isom(IMim”) as in (14), with A € O(n — 1, 1). 


Definition 3.26. The isometry group of co-Minkowski space, Isom(* Mim”), is the 
group of projective transformations of the form (15) for A € O(n — 1,1), ¢ € R”. 


Note that Isom(* Mim”) is a proper subgroup of the group of isometries of b*. For 
instance the latter also contains homotheties of the form [diag(1,..., 1, A)]. As in the 
co-Euclidean case, these homotheties correspond to displacement along parabolic 
lines, see Figure 19. 

Co-Minkowski space is naturally endowed with a degenerate metric g*, which is 
the hyperbolic metric on the hyperbolic hyperplanes, and zero on the parabolic lines. 


Figure 19. In this affine model of the co-Minkowski plane, the absolute is composed of two vertical 
lines. The other vertical lines are parabolic. On the left is the effect of a Minkowski homothety on a 
hyperbolic line of the co-Minkowski plane, and on the right is the effect of a Minkowski translation 
on a hyperbolic line of the co-Minkowski plane. 


Following the same reasoning as in the co-Euclidean case, we see that any 
(0, 2)-tensor field on the co-Minkowski space which is invariant under the action 
of Isom(* Mim”) must have parabolic directions in its kernel. 


Fact 3.27. There is no pseudo-Riemannian metric on (* Min", lsom(* Min”)). 


358 


F Fillastre and A. Seppi 


Comments and references 


One fundamental property of the support functions is that they behave well un- 
der addition. More precisely, the Minkowski sum A+ B = {a+b|A€EA, 
b € B} of two admissible convex bodies is an admissible convex body. 
If H4 and Hg are the corresponding support functions, then one has H4+z3 = 
H4 + Hg. Classical references for convex bodies are for example [14] and the 
up-to-date [59]. 


From Remark 3.9 and (2), the angle between two lines in the Euclidean plane 
can be written as the logarithm of a cross-ratio. This is the Laguerre formula, 
see e.g. [53]. 


Similarly to Fact 3.1 and 3.12, on has that (IR”, bp—2,2) can be identified with 
the projective space of dimension n minus a hyperplane endowed with the anti- 
de Sitter geometry. 


The duality between Euclidean (resp. Minkowski) and co-Euclidean (resp. co- 
Minkowski) spaces is expressed using a different formalism in [24] and [64]. 


Similarly to Fact 3.23, it is easy to see that the outside of co-Minkowski space 
(i.e. the projective quotient of (b* )~!(1)) is the space of time-like hyperplanes 
of Minkowski space. 


The boundary of an admissible convex set K in Minkowski space is the graph 
of a 1-Lipschitz convex function u: R’~! —> R. From (3.5), 


sap ou((.(2) 
(p,pn)EeK | Pn 


h= sup {bn—1,0(x, p) — pn} 
(p.pn EK 


1.e. 


1.e. 


h= sup {bn-1,0(x, p) —u(p)}. 
peR4Z 


i.e. u is the dual of h for the Legendre—Fenchel duality, see e.g. [14, 2.3] and 
references therein for more details. 


By Fact 3.25, a negative convex function on the open unit ball B is the support 
function h of an admissible convex set K in Minkowski space. Let us suppose 
that the lower semi-continuous extension g of h to the boundary of B is con- 
tinuous (this is not always the case, for example g could attain no maximum, 
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see p.870 of [33]). Let ho be the convex function on B such that the lower 
boundary of the convex hull of the graph of g is the graph of ho. Then ho = h. 
The function /g is the support function of a convex subset of Minkowski space 
which is the Cauchy development of the convex set whose support function is h. 
The function g encodes the light-like support planes of K (and 0B x R, the ab- 
solute of co-Minkowski space in a suitable affine model, is sometimes called 
the Penrose boundary). Note that a light-like support plane may not touch the 
boundary of K, as in the case of the upper branch of the hyperboloid. 


Let ho be the concave function on B such that the upper boundary of the 
convex hull of the graph of g is the graph of hg . It is possible to consider hg 
as the support function of a past convex set of Minkowski space. At the end 
of the day, at a projective level, it would be better to consider as “convex sets” 
of Minkowski space the data of a future convex set and a past convex set, with 
disjoint interior and same light-like support planes. 


The 3 dimensional spaces H?, Ad$? and *IMim? can be defined in a unified 
way as spaces of matrices with coefficient in R + «R, withk ¢ R, x? € R, 
see [30]. 


The reference [30] also contains a Gauss—Bonnet formula for the *Min? ge- 
ometry. 


4 Geometric transition 


In this section, we will study the so-called geometric transition of model spaces as 
subsets of projective space. Recall that a model space is an open subset M of RP”, 
endowed with a closed subgroup Isom(IM) of PGL(7 + 1, IR) which preserves the 
geometric structure of IM. 

Moreover, recall that given a model space M, by applying a projective transfor- 
mation g € PGL(n + 1, R) one obtains another model space which is equivalent to 
IM. Indeed, the group of isometries of gM is precisely g Isom(M)g~!. 

We say that a model space (IN, Isom(IN)) is a conjugacy limit or rescaled limit of 
another model space (IM, Isom(IM)) if there exists a sequence of projective transfor- 
mations gx € PGL(n + 1, R) such that 


i) the sequence g,M converges to IN as k + ov; 


ii) the sequence of closed subgroups gz Isom(IM) i converges to Isom(IN). 
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The convergence here should be meant as the Hausdorff convergence, for in- 
stance. Thus the conditions i) and ii) essentially mean that 


i’) every x € N is the limit of a sequence {g,x;,}, for some xz, € M, as k — co; 


ii’) every h € Isom(IN) is the limit of a sequence {g,hx a. for some hy € 
Isom(IM), as k — oo. 


In general, the sequences gz and hz are not compact sequences in PGL(n + 1, R). 

A toy model of a geometric transition is the 1-dimensional case, namely, the de- 
generation of projective lines. We already know that a line c in a model space is a 
subset of a copy of R P', with 0, 1 or 2 points in the absolute if c is elliptic, parabolic 
or hyperbolic respectively. More precisely, composing with a projective transfor- 
mation, we can assume that c is RP! (if elliptic), RP!\{[1 : 0]} (parabolic) or 
RP! \{[1: 1], [1 : —1]} (hyperbolic). The stabilizer of c in the isometry group of the 
model space is identified with a subgroup of PGL(2, R) of the form 


Re = |°°S 0 —sind T= la s = coshg —sinhg 
§~J| sind cosé |f’ eo Oe a? ~~ |sinhg coshe }f’ 
for c elliptic, parabolic of hyperbolic respectively. By applying the projective trans- 


formations 
_{[k 0 
8k = 0. 1\° 


one sees that given a sequence 6;, the sequence of conjugates 


=o cos 0, —k sin 0, 
Bk Ro 8e = fie sin®z cos Ox 


converges to a projective transformation of the form Tz provided 6, ~ a/k. In an 
analogous way, gx So, ti converges to T, ask — oo if gj ~ §, and g; maps 
the two points at infinity [1 : 1] and [1 : —1] of the hyperbolic line to [1 : tl and 
[1 : —t), which converge to [1 : 0] as k — oo. Hence: 


Fact 4.1. Elliptic and hyperbolic lines converge (in the sense of geometric transition) 
to a parabolic line. 


See also Figure 20. We will meet this phenomenon again in higher dimensions. 
We start by discussing some examples of geometric transition of 2-dimensional model 
spaces in RP3, 
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Figure 20. Geometric transition of elliptic line and hyperbolic line to Euclidean line, represented in 
the double cover 


4.1 Limits of 2-dimensional model spaces 


Recall that, in dimension 2, we have already introduced the model spaces correspond- 
ing to the elliptic plane Ell’, the hyperbolic plane H?, the de Sitter plane d$? (which 
is anti-isometric to Ad$*) and the degenerate model spaces of the Euclidean and 
Minkowski planes IE? and Mim?, and the co-Euclidean and co-Minkowski planes 
*E? and * Min’. 


Elliptic and hyperbolic planes limit to Euclidean plane. Let us consider the unit 
sphere S? in the Euclidean space 3. We shall construct a geometric transition which 
has the Euclidean plane as a conjugacy limit. For this purpose, consider the following 
projective transformations g; € GL(3, R), for t € (0, 1]: 


1 
- 0 0 
t 

Oa Ge 3G 

t 
0 0 1 


The transformations g, map the unit sphere S? to an ellipsoid. 
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Observe that g; fixes the point (0, 0, 1) € E? and “stretches” the directions x1, x2. 
More precisely, given a differentiable path x(t) € S?, for € [0, 1], such that x(0) = 
(0, 0, 1), one has 


X1(t) 

t x1 (0) 
lim g,x; = lim | X2(t) | = | %(0) 
t>0 t>0 | —— 

t 1 

x3(t) 


Roughly speaking, the rescaled limit is a point of the affine chart {x3 = 1}, which 
encodes the first order derivative of the x1, x2 coordinates. Heuristically, this rescal- 
ing procedure is a blow-up of a point, in the sense that the point (0, 0, 1) is preserved, 
and the transverse directions are “blown-up.” 

This procedure is actually well defined in projective space. Indeed the transfor- 
mations g; descend to projective transformations of R P?, which we will still denote 
by g;. The points of the Euclidean plane IK”, which is defined as 


E? = {x € E? | x2 = 1 /{+1d 
are conjugacy limits as t — 0 of sequences of points of the elliptic plane 
Ml? = {x € E? | b3,9(x, x) = 13/{+ Id}, 


thus satisfying condition 1) in the definition of conjugacy limit (see p. 359), for in- 
stance with t = 1/k, see Figure 21. To check that the Euclidean plane is a conjugacy 
limit of the elliptic space, one has to check also condition ii) on the isometry groups. 
That is, given an isometry of Ell”, namely an element in PO(3), let us choose a 
representative with determinant +1 of the form 


411 412 413 
h=|a12 a22 a3 
413 423 33 


By a direct computation, one checks that 


13 
aii 4120 — 


hei = 423 |. 
"8 [ a22 “| 
ta13 tA23 33 
: : : : 2 
In fact, g;hg;! is a transformation which preserves the quadratic form (4) x7 +f 


( ee + a, Thus every accumulation point of sequences of the form g;h(t) o is 
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necessarily of the form 


a11(0) ay2(0) 4 13(0) 
hoo = | a12(0)  a22(0) 23(0) | € Isom(E7), 


0 0 +1 
where 
411(0) 442(0) 
oe oO) =e) 
and 


ai1(0) ai2(0) O 
h(0) = | a12(0) a22(0) 0 
0 0 +1 


This shows that the conjugacy limit of the group of isometries of Ell? is precisely the 
group of isometries of IE”, embedded in PGL(3, R) in the usual way (see (8) and (9)). 
By a completely analogous proof, using the same transformations g;, one sees that 
IE” is a conjugacy limit of the hyperbolic plane H?, see Figure 21. Thus one can 
imagine that the Euclidean plane (at tf = 0) is an interpolation of the elliptic plane 
(for ¢ > 0) and the hyperbolic plane (for t < 0). 


Elliptic and de Sitter plane limit to co-Euclidean plane. We now describe a dif- 
ferent procedure which permits to obtain a different limit from the elliptic plane, 
namely, we will obtain the co-Euclidean plane as a conjugacy limit of the elliptic 
plane. Thus, consider 


- © 


gra € PGL(3, R) 


co oOo 


j=) 
~[—S © 


for t € (0,1]. As a remarkable difference with the previous case, the projective 
transformation g? leaves invariant a geodesic line of Ell, namely the line which is 
defined by the plane {x3 = 0} of E%. So the “stretching” occurs only in the transverse 
directions to {x3 = 0}, and indeed the rescaled limit of a differentiable path of points 
of the form x(t) = [x1(¢) : x2(t) : x3(t)] such that x3(0) = 0 is 


ba x1 (0) 

lim gx; = lim Mae) | x2(0) 

t—>0 t>0 | 22 | <3(0) 
t 
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We will indeed call this transition the blow-up of a line. This shows that points of the 
co-Euclidean plane, defined by 


*E? = {x € E? | b*(x, x) = 1}/{+Id} 
(where b* (x, x) = x7 + x3) are rescaled limits of sequences in 
Ell? = {x € E? | b3,9(x, x) = 1}/{+ Id}. 


For what concerns the isometry groups, we will give again a computation which 
shows that Isom(*IE?) is the limit of g* Isom(EIl*)(g*)~!. As before, choose a 
representative of determinant +1, say 


411 412 443 
h=|a12 d22 d23)] € PGL(3, R). 
413 423 433 


By a direct computation, 


a1 412 isi ai1(0) ai2(0) O 
gth(gi) |) = is he lng ee A 
. 33 pig). Gaza) =I 


provided a13(0) = a23(0) = 0 and 


oO a12(0) 
a12(0) d22(0) 


In conclusion the limit of g*h(g*)~! is an element of the group Isom(*IE”) in Defi- 
nition 3.10. 
Recall that we have introduced a duality of Ell? to itself, which maps lines of 


) € O(2). 


Ell? to points of Ell? and vice versa. In principle, the dual space is a model space in 
the dual projective space. The dual space of Elll* is thus identified to Ell’ itself if one 
chooses the scalar product of E? to identify IR P? to its dual projective space. Now, 
observe that the transformations g; and g*, which have been used to rescale Ell” to 
get IE? and *IE? respectively, are well-behaved with respect to this duality. Namely, 
b3.0(x, y) = Oif and only if b3.9(g:x, gy) = 0. In other words, if * denotes the 
duality point-line induced by the ambient scalar product, the following diagram is 
commutative: 


(Ell?, Isom(Ell?)) —**> (g, Ell”, g; Isom(EIl?)g7!) 


(ElI?, Isom(Ell2)) “> (g*EIl?, g* Isom(Ell?)(g*)~!) 
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When t — 0, we have already observed that (g;EIl’, g; Isom(Ell’) g, ') con- 
verges to (IE?, Isom(IE7)), while on the other hand (g* Ell’, g* Isom(EIl?)(g*)~!) 
converges to (*IE?,* Isom(*E”)). The natural duality of IE* and * IE? is again induced 
by the scalar product of IE? and the commutativity of the diagram passes to the limit. 
Thus, we have shown: 


Fact 4.2. The dual in *E of a rescaled limit X59. € E? of points x(t) € EIl? is the 
rescaled limit of the dual lines x(t)* in Ell? and vice versa (by exchanging the roles 
of points and lines). 


In a completely analogous way, one can define a geometric transition which per- 
mits to obtain the co-Euclidean plane * IE? as a conjugacy limit of the de Sitter plane 
dS?. Indeed, observe that Ell? and d$? are the constant curvature non-degenerate 
model spaces in dimension 2 which contain an elliptic line Ell’, whereas *E? is the 
degenerate space having an embedded copy of (Elll', Isom(EIl')). Therefore the 
following fact holds: 


Fact 4.3. The dual in *IK? of a rescaled limit Xo. € E? of points x(t) € H? is 
the rescaled limit of the dual lines x(t)* in d$*. The dual in E? of a rescaled limit 
Xoo € *EE* of points x(t) € dS? is the rescaled limit of the dual lines x(t)* in H?. 
The converse is also true, by exchanging the role of points and lines. 


We thus have the following diagram, which encodes the possible transitions and 
dualities involving E? and *IE?, see Figures 21 and 22: 


blow-up point blow-up point 
———_ —__— 


(Ell, Isom(Ell7)) (2, Isom(E?)) (2, Isom(H?)) 


(EN, Isom(EII2)) 22S"? S (12, Tsom(*E2)) SEE (a $2, Isom(d82)) 
(22) 


Geometric transitions with limits Minkowski and co-Minkowski planes. Given 
the above constructions, it is immediate to see that one can mimic the blow-up of a 
point for the de Sitter plane 


dS? = {x € Mim? | by (x, x) = 1}/{+ Id}. 
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Figure 21. Degenerations of the top line in Scheme (22), inside the ambient space, in the double 
cover 


| HT 


Figure 22. Degenerations of the bottom line in Scheme (22), inside the ambient space, in the double 
cover 
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Using again the transformations g;, the limit will be again represented by an affine 
chart defined by {x3 = 1}/{- Id}, and the conjugacy limit of Isom(d$?) & PO(2, 1) 
will be 


Isom(IMin?) = a2, a22 bo 


0 0 4+] a12 422 


eae fem 9) coun 


Hence we can say that the blow-up of a point permits to obtain (Min?, Isom(IMin*)) 
as a limit of (d$?, Isom(d$7)). 

Also in this case we shall obtain the dual transition. As in the case of the 
Euclidean/co-Euclidean plane, there are two possible transitions having limit the co- 
Minkowski plane. Indeed, both the hyperbolic plane and the de Sitter plane contain 
hyperbolic lines (space-like in H?, time-like in d$*). For the hyperbolic plane, one 
checks directly that 


g;Isom(?)(g7)" 


ai1 412 a a 
—> Isom(* Min?) = {| a2, ax 0 ie m2) £001) 
| 12 422 


Observe that, in terms of geometric transition, if (IN, Isom(IN)) is a conjugacy 
limit of (IM, Isom(M)), then also the anti-isometric space (IN, Isom(IN)) is a limit of 
(IM, Isom(IM)) (and also of (IM, Isom(IM)), of course), by conjugating for the same 
projective transformations. For instance, both d$? and dS? limit to *IMin?. How- 
ever, we prefer to say that (dS2, Isom(d$2)) limits to (*Mimn?, Isom(*Min7)) by 
blowing-up a space-like line, since with this choice, space-like lines (of hyperbolic 
type) converge to space-like lines of *Min?. 


For the same reason, blowing-up a point one gets that dS? limits to Min”. How- 


ever, in this special case, the space Min?, anti-isometric to Min?, is also isometric 
to Mim, thus we have a geometric transition 


(dS2, Isom(d$2)) —> (IMin?, Isom(Mim7)). 


This is formally not the same as the transition of dS* to Mim? (in d$? we have 
space-like elliptic lines which converge to space-like lines of Min*, while in dg2 
the space-like lines are hyperbolic and converge to space-like lines), although it is 
obtained for instance by applying the usual transformations g;. Recall also that the 
dual of dS$2, considered as the space of space-like lines, is dS? itself (see Figure 7). 
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By the same argument as in the previous paragraph (using the ambient metric of 


Min? on the left, and its anti-isometric Min? on the right), one obtains the following 
diagram which shows the behavior of transitions and dualities, see Figure 23: 


blow-up point blow- pues point a5 


(d$?, Isom(d$2)) ——> (Mim, Isom(IMim?)) << (4, Isom(d$2)) 


2 2, blow- pedi a line : blow- overt line > aax 
(H?, Isom(H?)) ———> (*Mim?, Isom(*Min?)) <—— (dS2, Isom(d$2)) 


(23) 


Figure 23. Degenerations of the bottom line in Scheme (23), in an affine chart (recall that dS? is the 
same as Ad$7) 
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In words: 


Fact 4.4. The dual in *Min? of a rescaled limit X59 € Mim? of points x(t) € dS? 
is the rescaled limit of the dual lines x(t)* in H?. The dual in Min? of a rescaled 
limit Xoo € *Mim? of points x(t) € H? is the rescaled limit of the dual lines x(t)* 
in d$*. The converse is also true, by exchanging the role of points and lines. 


Fact 4.5. The dual in * Min? of a rescaled limit Xoo € Mim? of points x(t) € ds? 
is the rescaled limit of the dual lines x(t)* in dS? and vice versa (by exchanging the 
roles of points and lines). 


4.2 Limits of 3-dimensional model spaces 


At this stage, the reader will not be surprised to find that the transition procedures 
described in the previous paragraph extend also to the three-dimensional case (and to 
higher dimensions, although this will not be considered in this survey). For instance, 
the following diagram summarizes the transitions which have limits in the Euclidean 
space or the co-Euclidean space: 


blow-up point blow-up point 
a —_—_——_ 


(Ell?, Isom(EIl)) (IE, Isom(IE3)) (3, Isom(H?)) 


(EI, Isom(EII3)) "PPS" (* 13, Tsom(*E3)) “PP” (4 $3, Isom(d$3)) 
(24) 

Indeed, the way to rescale elliptic space or de Sitter space to get a limit in co- 
Euclidean space is by blowing-up a plane. Space-like planes in Ell? and d$? are 
indeed copies of Ell”, and *E is the degenerate geometry having an embedded 
Ell* plane. It is thus an exercise to rewrite the statements of Facts 4.2 and 4.3 in 
the three-dimensional setting, by making use of the duality points/planes. 

The other diagram we considered in dimension 2 also has a generalization here. 
In fact, it will now become clear that the right model spaces which have a limit 
in Minkowski space (by blowing-up a point) are de Sitter and anti-de Sitter space; 
whereas their duals (hyperbolic space and anti-de Sitter space itself) have a limit in 
co-Minkowski space, see Figure 24: 


blow-up point blow-up point 


(dS3, Isom(d$?)) —— > (Mimn?, Isom(IMin?)) <—— (AdS3, Isom(Ad$?)) 


|: [ E 


(3, Isom(H3)) eS" Min, Isom(*IMim3)) ee "fA d $3, Isom(Ad$?)) 
(25) 
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Figure 24. Degenerations of the bottom line in Scheme (25), in an affine chart 


In fact, in dimension 2 we did not define an anti-de Sitter plane, as it would be 
anti-isometric to the de Sitter plane. Thus we write here the statement which encodes 
the relationship between transitions and dualities for anti-de Sitter space, leaving the 
analogous statements to the reader: 


Fact 4.6. The dual in *Min? of a rescaled limit x9. € Mim? of points x(t) € AdS$? 
is the rescaled limit of the dual planes x(t)* in AdS? and vice versa (by exchanging 
the roles of points and planes). 


Blowing-up time-like planes. However, we observe that in dimension three there 
are more complicated pheonomena which might occur. In particular, one might be 
interested in the blow-up of a time-like plane in a Lorentzian model-space. Of course 
this is the same of blowing-up a space-like plane in the anti-isometric model space. 
Intrinsically, the time-like plane can be a copy of dS? or dS?. Let us analyze the 


Spherical, hyperbolic, and other projective geometries 371 


following diagram: 


blow-up point 7,3 blow-up point ———> 


(d$3, Isom(d$3)) ——> (Min? Isom(Min? )) ———_ (Ad$3, Isom(AdS3)) 


' : i 


(d$3, Isom(d$3) ee" Min3, Isom(* Min?) ee "(A dS3, Isom(AdS2)) 


On the upper line, there is nothing surprising. We have just re-written the usual 
blow-up of a point, but by considering the anti-isometric copies of dS, Mim? and 
AdS$3. In fact, the definition of geometric transition does not distinguish between a 
model space and its anti-isometric copy. Anyway, we decided to stick to the conven- 
tion to choose the sign of the metric in such a way that space-like lines converge to 
space-like lines, and so on. 

On the lower line, we have already encountered the duality which appears on the 
left. In fact, in Min‘ the linear hyperplanes which define time-like planes in dS? also 
define planes in H? (see Remark 2.18). Hence the space of time-like planes in d$?, 
or equivalently the space of space-like planes in dS3, is naturally dS? itself. 

In the right-hand side, recall that A.d$? can be defined as 


AdS3 = {x € R?? | by 9(x) = 1}/{+ Id}. 


Thus a space-like plane in Ad? is a copy of d$?, and the dual space of AdS3, 
considered as the space of space-like planes of AdS3, is Ad$3 itself. 


It remains to understand what is the limit space in the centre of the lower line. 
We denoted it by *IMim? to indicate that it is the dual of Mim? (and not the space 
* Mim? anti-isometric to * Min? !) We will omit the details of the definition and the 
proof of the commutativity of the last diagram presented. However, observe that in 
the lower line, both d$? and A.d$3 contain a totally geodesic copy of d$?. Thus one 
can define a transition procedure which blows-up a time-like plane, stretching the 
transverse directions. Topologically the limit space is expected to be dS? x R. By 
a construction similar to that of *IMim?, one can identify this space to the space of 


time-like planes in Mim? (or of space-like planes in Mim?). In an affine chart, this 
would be the exterior of the cylinder which represents *Min?. 


Comments and references 


* Let us denote by Gal? the Galilean plane, i.e. the projective plane minus a 
parabolic line. See [69]. Below are shown the possible degenerations of the 
three Riemannian and the three Lorentzian plane geometries. This diagram is 
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the one in Section 5.3 of [24], adapted to our terminology. 


Ell2 H2 d$2 
E2 *Mimn * 2 Min? 
Gal? 


This essentially shows that the geometric transitions considered in this paper 
are all the possible transitions in R P?, except the further space Gal? which is, 
in some vague sense, doubly-degenerate. Essentially, the isometry group for 
Gal? is the subset of triangular matrices which preserve an affine chart. 


¢ In [24], the possible degenerations are classified in every dimension. Already in 
dimension 3, apart from those considered above, there are other simple degen- 
erations, essentially obtained by blowing up a line. These spaces are described 
as model spaces (X, Isom(X)) and in some cases they contain as subgeome- 
tries (i.e. as a geometry (X,G) where G is a subgroup of Isom(X)) other 3- 
dimensional geometries in the sense of Thurston. For instance Sol geometry is 
a possible limit of hyperbolic structures [42]. One could draw the correspond- 
ing diagram in dimension 3, which would already be pretty complicated, and 
find several other spaces as double degenerations. For example, an affine space 
endowed with the action of a group of lower triangular matrices, with unitary 
elements on the diagonal, is a generalization of Gal’. 


¢ For the effect of the two-dimensional transition hyperbolic-Euclidean-spherical, 
the effects on angles, area etc. of triangles are studied in [3]. 


5 Connection and volume form 


In this section we discuss the definitions of the Levi-Civita connection and volume 
form for model spaces, starting by the general setting of Riemannian or pseudo- 
Riemannian manifolds and then specializing to the cases of constant curvature mani- 
folds. We will then give a construction of a geometric connection and a volume form 
on the degenerate cases of co-Euclidean and co-Minkowski space. 
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5.1 Non-degenerate model spaces 


Let (M, g) be a three-dimensional Riemannian or pseudo-Riemannian manifold. 
Many key examples have already been introduced: E?, H? and Ell? are Rieman- 
nian, Min?, dS? and Ad$3 are pseudo-Riemannian. Recall that the non-degenerate 
metric g uniquely determines the Levi-Civita connection, which we will denote by 
V® or V™ if there is no ambiguity about the metric tensor g. Indeed, the Levi-Civita 
connection is the unique connection which satisfies the following properties: 


° V8 is symmetric, i.e. Vy Y — V¥ X = [X,Y] for any pair of vector fields X, Y; 


¢ V® is compatible with the metric, i.e. Z.g(X, Y) = g(VEX, Y)+2(X, V5Y) 
for any vector fields X, Y, Z. 


The latter condition can also be expressed by saying that V?g = 0, namely the 
metric tensor is parallel with respect to V*. Recall also that a volume form is defined 
on (M, g), provided M is oriented. Namely, the volume form is a 3-form @ such 
that w(X, Y, Z) = 1 if X, Y, Z is an oriented (with respect to the orientation of M) 
orthonormal frame for g. The volume form is also characterized by being the unique 
3-form w such that 


* w is parallel with respect to V8, i.e. V8@ = 0; 


* w(V1, V2, v3) = 1, where v1, v2, v3 is an oriented orthonormal triple of vectors 
at a fixed point Xo. 


Indeed, as the parallel transport preserves an oriented orthonormal basis, the volume 
form is parallel for the connection, i.e. V?@ = 0, i.e. 


Z.(X1,...,Xn) = OVEN, ..., Xn) t+ $O(K1,..., VEXn). (26) 


Explicit construction of the ambient Levi-Civita connection and volume form. 
Let us take a step back, and see how the Levi-Civita connection of the three-dimen- 
sional manifolds considered here can be defined. For instance, the Levi-Civita con- 
nection of Euclidean space is simply given by differentiation of the standard coordi- 
nates of a vector field, that is, 


Ve w = Dwi(v), 


for any pair of smooth vector fields v,w. The connection of Minkowski space is 
defined analogously, and thus on the same affine space, the Levi-Civita connections 
of the Euclidean and the Minkowski metric coincide. Moreover, the standard volume 
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form of R? coincides with the volume forms (induced by the metric) of E3 and Min?, 
i.e. in the standard coordinates (x, y, Z): 


Op3 = Opin = AX Ady Adz= @p;. (27) 


Of course, the same definitions can be given for any n-dimensional vector space 
endowed with a non-degenerate symmetric bilinear form b. Now we will use this 
fact to define the Levi-Civita connection and the volume form of the model spaces. 
We have already observed that for all non degenerate model spaces M = M/{+ Id} 
which we have defined as the (projectivization of the) subset M of (IR*+, b) where the 
quadratic form defined by b takes the value | (or —1), the normal vector to M at a 
point x is precisely x itself. Thus, by the definition of the Levi-Civita connection of 
an embedded hypersurface in a higher-dimensional manifold, we obtain, for vector 
fields v, w tangent to M, 


Dw(v) = Vw — bv, w)x. 


We used that the identity and the second fundamental form coincide (up to a sign) 
with the first fundamental form (see also Section 6 below). Clearly this definition 
descends to the definition of the Levi-Civita connection of IM. 

Also the volume form of JM (and thus of IM) can be defined in terms of the ambient 
volume form of R*. Indeed, given any triple of vectors v, w, u in T,.M, x is also the 
normal vector to M, and thus one can define 


om (VU, W,U) = WOpa(x,v, w,U). 


5.2 Degenerate cases 


We will now introduce a natural connection and volume form on *E? and *Min?. 
We will discuss the meaning of naturality in the following, but of course we can 
anticipate that a natural connection/volume form will be preserved by the isometry 
group of co-Euclidean (resp. co-Minkowski) space, as introduced in Definitions 3.10 
and 3.26. 


The connection of co-Euclidean space. To define a connection on *E? 


by defining a connection on its double cover, namely 


, we Start 


cobuc® = {(x1, X2,x3, x4) | xf +23 +.x3 = I= {x € R*| b*(x, x) = I}, 


and thus we will consider $8” x R. as a model for co €uc?. In the classical case of model 
spaces defined by a non degenerate symmetric quadratic form b, the key ingredient 
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to define the Levi-Civita connection was the existence of the normal vector field N, 
so as to be able to write 


Dw(v) = Vw + b(v, w)N. 


Clearly the normal vector field has the property that it is preserved by the group of 
isometries of the ambient quadratic form b. In this degenerate case, the bilinear form 
b* is degenerate, and thus it does not enable us to determine a unit normal vector 
field. However, there is a well-defined transverse vector field to co€uc?, namely the 
vector field which at the point x € co€uc* C R* is defined by 


N, =x € R¥, 


Tautologically, this vector field N is preserved by the group Isom(*IE*), which means 
that if A € Isom(*E3), then Ay (Nx) = Nacx). Thus one can use the vector field N to 
decompose the ambient derivative of two vector fields in a tangential and a “normal” 
component. 


Definition 5.1 (co-Euclidean connection). Given two vector fields v, w in co&uc?, 
, . 3 
we define the connection V°&""" by means of 


: a 
Dw(v) = yo wt b*(v, w)N, 
. 3 ‘i : x3 . 3 
The co-Euclidean connection is the connection V- © induced on *E? by V°?&"", 


First, it should be clear that the co-Euclidean connection is preserved by the group 
of isometries of co-Euclidean space. We prove it now, and in fact this also follows 
from the characterization given in Proposition 5.3 below. 


. ; *p3 io. ; 
Lemma 5.2. The co-Euclidean connection V ©" is invariant for Isom(*E?). 


Proof. From the definition, we have 


Verse? Aw = D(Axw)(Aav) — b* (And, Axw)Nace) 
= A,(Dw(v) — b*(v, w)Nx) 


7 3 
= yore w 


’ 


since the ambient connection D and the vector field N are invariant for the action of 
Isom(*E°). 
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We will denote by T the vector field on co€uc? defined by (0, 1) in T,.co8uc? & 
T,,S* x R. It is a degenerate vector field invariant for the group Isom(co€uc?), as 
seen from the form inside the brackets of (15). Observe that T does not descend to a 
global vector field on *IE3, but we will still talk about the vector field T (by an abuse 
of notation) as the vector field induced on any simply connected open subset of * 3. 


Proposition 5.3. The connection VE is the unique connection on * IE? such that 
° ve is symmetric, i.e. 
Vee Y —V,e xX = [X,Y] 
for any pair of vector fields X,Y; 
2 VE is compatible with the degenerate metric g* of *E°, i.e. 
ZENS Ve, AES OGD 
for any vector fields X,Y, Z; 


“VE preserves every space-like plane of *1E°, i.e. for V, W vector fields on a 
space-like plane P, es W is tangent to P; 


* the vector field T is parallel with respect to VE ie VET =0. 


In particular, the restriction of V © to any space-like plane coincides with the Levi- 
Civita connection for the induced metric. 


Proof. It suffices to prove the statement for the double cover co€uc?, since clearly 
—Id acts on co€uc? as an isometry of co€uc?, and preserving yeoSuc Te is straight- 
forward to check that V°?&"<* defines a connection on co€uc?. Symmetry follows 
from the observation that Von y — veer is the tangential component of 
(DxY — DyX) = [X,Y], which equals [X, Y] itself. Also compatibility is very 
simple: for every vector Z tangent to co&uc?, 


Z.g*(X,Y) = e*(DzX,Y) + g*(X, DzY) 
= pry x, Y) a g(x. yo Y). 


For the third point, let P be a plane of co€uc? obtained as the intersection of co€uc? 
with a linear hyperplane P’ of R*. Given vector fields V, W on P, Dy W is tangent 
to P’ and thus the projection to co€uc? is still in P. Finally, it is clear from the 
construction that the derivative of T in any direction vanishes. 
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Let us now assume that the four conditions hold. In the coordinate system pro- 
vided by S? x R, the restriction of V° Buc} to every plane S? x {*} preserves the plane 
itself (by the third point) and coincides with the Levi-Civita connection of S”, by the 
second point. 

Hence it is easily seen that the Christoffel symbols rk are those of the Levi- 
Civita connection when 7, j,k correspond to coordinates of S?. Otherwise, using 
the first and fourth hypothesis, the rk vanish. Hence the connection is uniquely 
determined. 


Corollary 5.4. The geodesics for the co-Euclidean connection V'=* coincide with 
the lines of *E°. 


Proof. Again, we prove the statement for the double cover co€uc?. Given a space- 
like line 7 of co€uc?, using the action of Isom(co&uc?) we can assume that / is 
contained in the slice S? x {0}. Since the connection on such a slice coincides with 
the Levi-Civita connection, and lines of co€uc? are geodesics for this copy of 87, 
I is a geodesic for yeosue* If] is not space-like, then it is of the form {*} x R. 
Since Veo Suey = 0, it is clear by construction that / is geodesic, provided it is 
parametrized in such a way that its tangent vector is a fixed multiple of T for all time. 

Since there is a line of co€uc? through every point of co&uc? with every initial 
velocity, this shows that all geodesics for the connection V Suc} are lines of co€uc?. 


The volume form of co-Euclidean space. By means of the transverse vector field 
N in R4, we can also perform the usual construction to define a volume form for 
co-Euclidean space. Indeed, we can give the following definition: 


Definition 5.5 (co-Euclidean volume form). The volume form of co€uc? is the 
3-form 6,3 Such that, given vectors v, w,u in Tyco Euc?, 

WeoGuc3(U, W,U) = Opa(N, v, w,U). 
The co-Euclidean volume form is the volume form w«,3 induced on * IE? by 


coSuc3* 


Both the volume form of the ambient R‘ and the vector field N are invariant for 
Isom(*IE3), hence clearly: 


Lemma 5.6. The volume form @,9,,3 is invariant for the group Isom(*E°). 


Of course, as there is no Riemannian or pseudo-Riemannian metric invariant by 
the group of isometries of *IE? (Fact 3.11), there is no volume arising as the vol- 
ume associated to a metric. However, the volume form w,3 has the property that 
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@+—3(v, w,T) = | provided v, w are orthonormal space-like vectors for the degen- 
erate metric of *IE3, and T is the “unitary” degenerate vector field, so that the triple 
(v, w,T) is positively oriented. As in the classical case, since parallel transport pre- 
serves both the degenerate metric (this follows from the compatibility with the metric) 
and the vector field T (see Proposition 5.3), the volume form wj,3 has the following 
characterization: 


Proposition 5.7. The volume form W«,3 is the unique volume form on * IE? such that 
: “ *p3 * 3 
* W+,3 is parallel with respect toV “, i.e. V “ @x—3 = 0; 


* W«~3(v, W,T) = 1, where v, w,T is an oriented triple at a fixed point xo, such 
that v, w are orthonormal space-like vectors. 


The case of co-Minkowski space. Very similar constructions can be used to define 
a connection and a volume form for co-Minkowski space. Indeed, we define the 
vector field N, which is transverse to coMin? ~ KH? x R in the ambient space R‘, as 
the vector field Ny = x € R*. Clearly this definition is invariant by the isometries 
of coMin?, hence in particular by the involution which identifies the two connected 
components of 1? x R. For simplicity, we identify *IMin? ~ H? x R as one of 
the two connected components of H? x R. Recalling that the bilinear form for co- 
Minkowski space b* has the form xe + x5 — Xe in the (x1, X2, x3, X4)-coordinates, 
using the ambient connection and volume form of R4, one obtains the definitions of 
co-Minkowski connection and volume form: 


Definition 5.8 (co-Minkowski connection). Given two vector fields v, w in *Min?, 
2 * We m3 
we define the connection V-M™” by means of 


Dw(v) = V, Min? y, + b*(v,w)N. 


Definition 5.9 (co-Minkowski volume form). The volume form of *Mim? is the 3- 
form @. py; ,3 Such that, given vectors v, w,u in i Min?, 


Mxyin3 (VU, WU) = Opa(N, v, w,u). 


We report on the key properties below, without giving the proofs, as they are 
completely analogous to the co-Euclidean case. 


* Te om D 
Lemma 5.10. The co-Minkowski connection V™'™ and volume form @eo 64.3 Ave 
invariant under the action of the isometry group Isom(* Min?). 
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Here we denote by T the normalized degenerate vector field (as we recall that 
there is a notion of length of the degenerate direction which is preserved by the group 
of isometries of *Mim?). In the (x1, x2, x3, x4)-coordinates, T can be written as 
(1, 0, 0, 0). 


Proposition 5.11. The connection V™™” is the unique connection on * Min? such 
that 


*Mim? - ei i 
¢ V Min” is symmetric, i.e. 


V,Min’ y = Vin? x = LX, Y] 
for any pair of vector fields X,Y; 


* INT on 3 x 7 A ° : 
« V Mim” is compatible with the degenerate metric g* of *Mim%, i.e. 


vy" Min* o* — 0; 


. V* Min? preserves every space-like plane of *Mim?, ie. for V, W vector fields 


ona space-like plane P, Vv) Min° W is tangent to P; 
* the vector field T is parallel with respect to v*Min® i.e, V'Min’T = 0, 


Corollary 5.12. Geodesics for the co-Minkowski connection Vv" Min* coincide with 
lines of *IMin?. 


Proposition 5.13. The volume form Oxy4;,,3 18 the unique volume form on *Min? 
such that 


. . *Min? . * Min? _—0- 
° Mxyin3 48 parallel with respect to V , te V Oxnin3 = 9: 


© Mxyim3 (VU, W,T) = 1, where v, w,T is an oriented triple at a fixed point xo, 
such that v, w are orthonormal space-like vectors. 


Relation with geometric transition. We will now show that the connections of co- 
Euclidean and co-Minkowski space are natural also in the sense of geometric transi- 
tion, that is, they are the limits of the Levi-Civita connections under the transitions 
described in Section 4.1 and 4.2. We will focus on dimension 3 for definiteness, and 
consider the transition of Ell? and d$? to *IE3, as described in Scheme (24), which 
makes use of the projective transformations 


100 0 

010 0 
g*=|0 0 1 0] €PGL(4,R). (28) 

1 

00 0 = | 
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Proposition 5.14. Let X;, Y;, Z; be smooth families of smooth vector fields on EII? 
(resp. dS*) such that Xo, Yo, Zo are tangent to a plane P. The limit of g; X; isa 
vector field on *E3 which we denote by X, and analogously for Y and Z. Then, 


Bed * 3. * 3 3 * D3 x 
* the limit of g; ve" Y (resp. g; ve Y) is var Y; 
© the limit of o™™ (X;, ¥;, Z:) (resp. wo (Xz, Y:, Zt)) is o (X,Y, Z). 


Proof. First, let us assume that P is defined by x4 = 0 in RP®, as in Section 4.2, 
so that we can use the projective transformations in (28). We can as usual perform 
the computation in the double cover co€uc?. Hence in those coordinates, the vector 
field Xo, Yo, Zo have x4-coordinate equal to 0. Hence, the limit of g7 X; is 


(Xi(x))1 (Xo(x))1 
_ | HrO)2 (Xo(x))2 
lim g¢ Xe(x) = lim | (X03 | = | (vaca) 
t> ie iy : 
(LiO)a (X(x))4 

t 


namely, it is a vector field on *IE* whose vertical projection is identified with the 
vector field Xo (x). 
Now, observe that g* maps 8? to 


$3 := g* (83) = {x | x? 2x2 4.43 + 7x? = 1}. 


Moreover, g* is an isometry if we endow S? with the metric induced by the quadratic 
form 


bi g(x, x) =xptxp 4x3 4 07x7. 


Hence g* maps the normal vector N(x) = x of 8? to N;(x) = g*(x), which is 
precisely the normal vector to 8} for bi, o- 

Now, as Xo, Yo, Zo are tangent to P, their base point x; are rescaled to gfx; 
which converge tO X99, base point for X ; y ; Z. For the same reason, the normal 
vector N; (x1) = g7 (xr) to 8} at Xz converges tO Xoo, which is (in the notation of 
Definitions 5.1 and 5.8) the vector N(x90) used as a transverse vector field to co€uc?. 

Hence in conclusion, 

ySy _ VS? xy _ t 
St Vy, Y; = Verx, 8 Y, = DY;(X;) — ba (Xt, Y;)Nz (xz) 
converges to 
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Analogously, 
Wg2(X;, Y;, Zt) = Wpa(N, Xt, Y;, Zt) me op (Ne, ge Xr, ae oe g Zr) 


converges to 
co Euc3 (X, Y, Z) = Wp4(N(Xoo), x, y; Z). 


Of course, the proof is completely analogous for the rescaling from d$?. 


Actually, the proof is similar for the cases of geometric transitions of H? and 
Ad? to *Mim?, hence one can prove: 


Proposition 5.15. Let X;, Y;, Z; be smooth families of smooth vector fields on 1? 
(resp. AdS*) such that Xo, Yo, Zo are tangent to a plane P. The limit of gF Xt isa 
vector field on *IMim? which we denote by X, and analogously for Y and Z. Then, 


Seer, 3 3 b «Mem vt 
¢ the limit of gf Vi Y (resp. eves Y) is yous Y: 


© the limit of o (X,,Y;, Zt) (resp. o®4® (X;, Y;, Z;)) is o Mim’ (X, Y, Z). 


5.3 The infinitesimal Pogorelov map 


Weyl formula for connections. In an affine chart, a (non-degenerate) model space 
IM has the strong property that its (unparametrized) geodesics are the same as in the 
ambient IR” endowed with a Euclidean metric. This will imply that, in an open set, 
the Levi-Civita connection V of IM can be written in terms of the usual connection on 
IR”. The manifold IM may not be orientable, but on an affine chart we can consider the 
same orientation as IR”. Hence the pseudo-Riemannian metric of IM gives a volume 
form in the affine chart. 

Let N be an orientable manifold with two torsion-free connections V and V and 
with two volume forms w and @ such that Vo = 0 and VO = 0. Let A be the 
function on N such that 


O=do. 


Let us introduce 
D(X, Y) = VyY¥ —VyY. 


This function is linear over smooth functions on both argument, so it is a (1, 2)-tensor. 
As the connections are symmetric, it is easy to check that D is symmetric. 
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Lemma 5.16. The contraction of D is a \-form that is given by i: 
CD =d Ini, (29) 


Proof. Recall that the volume form @ is parallel for V (26), that gives, in the given 
local coordinates on N, 


n n 
CHO o( » Tk ex, weds en) feet o(er, ety >» Ther): 
k=1 k=1 
But @ is alternating and n-linear: 
n 
IiO1,..n = OTH C1,---5€n) Hee + O(C1,- 0. Men) = (> PH) Orman 
a=1 


namely 
n 
Yo T% = 4 In(r,....n) 
a=1 


and in particular, 


n n 
ee ee ee 
a=1 a=1 


Lemma 5.17 (Weyl Formula). Let us suppose that both V and V have the same 
(unparameterized) geodesics. Then 


Vy ¥ —Vy¥ = Xn AT#T)Y + YIN AMT), 


Proof. Observe that in this formula both sides are linear in X and Y. To compute the 
left-hand side, fix p and X, € T,N and let c: 1 — N be a parameterized geodesic 
for V with tangent vector X, at c(0) = p. Extend X, to X, the tangent vector of 
c, which is parallel along c, so that Vy X = 0. Let X be the tangent vector of a 
reparametrization of c that turns it into a geodesic ¢ for Vv (hence Vv eX = 0). Then 


there is a function f on c (which depends on the choice of Xp and Xx p) Such that 
X = fX. We compute 


(Vy X — Vy X)(p) = (Vx X)(p) 
= Veep ef X) 
= f(p)(Xp.f)Xp + f(pP Vg, Xp 


= f(pyCpt) Xo: 
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so D(X, X) = X.In(f)X. Let us define (X) = X.In( fx)/2, so that 
D(X, X) = 26(X)X. (30) 


Let us show that ¢ is a 1-form. In fact, given any symmetric tensor D such that 
D(X, X) is a multiple of X, the function ¢ satisfying (30) is uniquely determined 
and clearly satisfies 6A X) = 1¢(X). For the additivity, from the symmetry of D, 
we have 


D(X + Y,X+ Y) = D(X, X)4+ DIY, Y) + 2D(X,Y), (31) 


hence 
P(X +YVY)\(X+ VY) = o(X)X + d(Y)Y + D(X, Y). 


On the other hand, developing the same expression for D(X — Y, X — Y) gives 
o(X —Y)\(X —Y) = @(X)X + o(Y)Y — D(X, Y). 
Putting together the two expressions, one obtains 
(P(X + Y) + G(X —Y)—26(X))X = (G(X —Y)— b(X + ¥) + 26(Y))Y. 
Since it now suffices to consider X and Y linearly independent, one has 
P(X +Y) + P(X —Y)—29(X) = G(X —Y)— G(X + Y) + 26(Y) = 0 


and therefore 6(X + Y) = $(X)+¢(Y). Hence, using the linearity of @ which was 
just proved, from (30) and (31) one obtains 


D(X,Y) = b(X)¥ + o(Y)X. 


Finally, contracting on both side using (29) leads tod Ind = (n + 1). 


Killing fields. If V is the Levi-Civita connection of a pseudo-Riemannian metric g 
on a manifold N, a vector field is called a Killing field if it generates a 1-parameter 
group of isometries. A vector field K is a Killing field if and only if Lyg = 0 
(Z£ is the Lie derivative), or if and only if VK is a skew-symmetric (1, 1)-tensor: 


g(Vx K,Y)+ g(X, Vy K) = 0, (32) 


for every X,Y. It is easy to see that this condition is actually equivalent to the con- 
dition that g(Vx K, X) = 0 for every X. 
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The infinitesimal Pogorelov map. Let us suppose that V is also a Levi-Civita 
connection for a pseudo-Riemannian metric g on N. Let L := Lg g be the map 
TN — TN defined by 


g(X,Y) = a(L(X),Y) (33) 
for every Y. The infinitesimal Pogorelov map 

P:=Pgg:TN —>TN 
is defined by 


P(X) = Aw#T L(X). 


Lemma 5.18. The infinitesimal Pogorelov map Pg,g sends Killing fields of g to 
Killing fields of g: if K is a Killing field of (N, g), then P(K) is a Killing field of 
(N, 8). 


Proof. By definition, 


2(P(K), X) = g(A7#TK,X), (34) 


which implies 


X.8(P(K), X) = X.g(A™F K, X), 
and using the fact that K is a Killing field of (N, g), we arrive at 


g(Vx P(K), X) = (X.ATFT)g(K, X) — gATTK, Vy X —VyX). (35) 
Observe that by Weyl formula (Lemma 5.17), 
g(ATFT K, Vy X — VyX) = 2 (ATT K, 2X. (In ATT) X)) = (X.AT1)8(K, X). 


Hence from equation (35) one gets &(Vx P(K), X) = 0, i.e. P(K) is a Killing field 
of (M, g). 


The infinitesimal Pogorelov map from hyperbolic to Euclidean space. Let us 
consider the usual Klein model of the hyperbolic space H”, i.e. the affine chart 
{Xn+1 = 1}, in which H” is the open unit ball 


B” = {x € R” | bao < 1}. 


We will compare the hyperbolic metric g on B” with the standard Euclidean met- 
ric. For x € B”, the point r(x) := p(x) Gy with 


1 
pO) = a/1 — bn,o(x, x) 
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belongs to H” in Mim”*!.!7 So if X € T, B”, then 


P(X)*Bno(x, X)x + 2 


dr(X) = ptx( p(x)*Bn,o(x. X) 


belongs to T,(,)H{", and it is then straightforward that the expression of the hyper- 
bolic metric in the Klein model is 


&x(X,¥) = p(x)*bno(X, Y) + p(x)*bn,o(x, X bn o(x, Y). (36) 


The radial direction at x is the direction defined by the origin of IR” and x. 
From (36), a vector is orthogonal to the radial direction for the Euclidean metric 
if and only if it is orthogonal for the hyperbolic metric. A vector orthogonal to the 
radial direction is called lateral. Then for a tangent vector X of B” at a point x, 


¢ if X is radial, then its hyperbolic norm is p(x)? times its Euclidean norm. 
¢ if X is lateral, then its hyperbolic norm is p(x) times its Euclidean norm. 
With respect to the definition of L = Lg, , in (33), 

Lx(X) = p(x)?X + p(x)*bn,o(x, X)x 


and a lateral (resp. radial) vector is an eigenvector for L with eigenvalue p(x)? (resp. 
p(x)*). In an orthonormal basis for by, det(gx) = det(Lx) = p(x)?*, so 


—(n+1) 


OR” = P On”. 


Let K be a Killing field of H”. Then by Lemma 5.18, P(K) = p~7L.(K) is a 
Euclidean Killing field. More precisely, 


P(K), = Ky + (x)7bn,o(x, Ky)x. 
In particular, 


* if K is lateral, the Euclidean norm of P(K) is equal to p~! times the hyperbolic 
norm of K; 


¢ if K is radial, the Euclidean norm of P(X) is equal to the hyperbolic norm of K. 


17 Tf t is the hyperbolic distance from r(0) to r(x), then p(x) = cosht. 
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The infinitesimal Pogorelov map from anti-de Sitter to Minkowski space. Let us 
consider the model of anti-de Sitter space Ad” given by the affine chart {x,4;=1}. 
Recall that AdS” is the projective quotient of 


Ads" = {(X, Xn41) E Re | bn—-1,2 = —1} 
and its image in the affine chart is 
H"= {x € R” | bn-1,1 < 1}. 


We will compare the anti-de Sitter Lorentzian metric g on H” with the Minkowski 
metric on R”. For x € H”, the point p(x)(7) with 


1 


px) = ———$——_— 
Vv 1- bn—1,1 (x, x) 
belongs to AdS”. A computation similar to the hyperbolic/Euclidean case gives 
&x(X,Y) = p(x)*bn—1,1(X, Y) + p(x)*bn-1,1%, X)bn-1,1 (x, Y). 


As for the Klein model of the hyperbolic space, a vector has a radial and a lateral 
component, and this decomposition does not depend on the metric. We also have 


Lx(X) = p(x)?X + p(x)*bn-11(%, X)x, 
whose determinant in an orthonormal basis for b,—1,1, is p(x)2%+), so (recall (27)) 


ORn = oer 1) 


Aas". 
Let K be a Killing field of AdS”. Then by Lemma 5.18, 
p 7 L.(K) = K + p?bn-11(, K)- 


is a Minkowski Killing field. 


Comments and references 


¢ A symmetric connection such that there exists (locally) a parallel volume form 
is characterized by the fact that its Ricci tensor is symmetric, [48, Proposi- 
tion 3.1]. 
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¢ Let V be a torsion-free complete connection on a manifold N, such that a local 
parallel volume form exists. An affine field of V is a vector field that generates 
a l-parameter group of transformations that preserves the connection. Let R 
be the curvature tensor of V: 


R(X, Y)Z = VxVy Z — Vy VxZ — Vix,y]Z. 
Then K is an affine field if and only if (see e.g. [41]) 
Vy Vx K = —R(K,Y)X. 


Let V be another connection on N with the same properties as V, both having 
the same unparametrized geodesics. Let R be its curvature tensor. Then a direct 
computation using the Wey! formula shows that 


RIX, Y)Z = RIX, YNZ + Vf Hd fC DY —("7 Ff 4 FC. ZX, 


where V7? is the Hessian for V. See pp. 126f in [41] for the relations between 
infinitesimal affine transformations and Killing fields. 


Lemma 5.18 was proved in [40] and independently in [68]. See also the end of 
Section 6.5. 


The simplest infinitesimal Pogorelov map is the one from Minkowski space 
to Euclidean space: it suffices to multiply the last coordinate of the Killing 
field by —1. This was noted in [34]. The term infinitesimal Pogorelov map 
comes from the fact that it was defined as a first-order version of the so-called 
Pogorelov mapping, see the end of Section 6.5. See [45], [58], and [30] for 
some applications. 


6 Geometry of surfaces in 3-dimensional spaces 


The purpose of this section is the study of embedded surfaces, with particular at- 
tention to convex surfaces, in 3-dimensional model spaces. We will first review the 
classical theory of embeddings of surfaces in 3-manifolds, with special attention to 
the constant curvature cases we have introduced so far. After that, we will define 
analogous notions in the degenerate spaces (for instance, the second fundamental 
form and the shape operator), in particular co-Euclidean and co-Minkowski spaces, 
and show that these notions have a good behavior with respect to both the geometry 
of *IE? and *Mimn?, although the ambient metric is a degenerate metric, and with 
respect to duality and geometric transition. 
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6.1 Surfaces in non degenerate constant curvature 3-manifolds 


Let (M, g) be a Riemannian or pseudo-Riemannian manifold. Given a smooth im- 
mersion 0: S — M with image a space-like surface o(S) in (M, g), recall that the 
first fundamental form is the pull-back of the induced metric, namely 


I(v, w) = g(ox(v), ox(w)). 


The Levi-Civita connection V* of the first fundamental form I of S is obtained from 
the Levi-Civita connection of the ambient manifold: given vector fields v, w on S, 
vy w is the orthogonal projection to the tangent space of S of ve y(OxW). 


Let us denote by N a unit normal vector field on S, namely N is a smooth vector 
field such that for every point x ¢ S, Nx is orthogonal to 7,(,)o(S), and such that 
|g(N, N)| = 1. The second fundamental form II is a bilinear form on S defined by 


VM (o4t) = Vow + Hv, w)N (37) 


where w denotes any smooth vector field on S extending the vector w. This form 
turns out to be symmetric and it only depends on the vectors v and w, not on the 
extension of any of them. The shape operator of S is the (1, 1)-tensor B € End(TS), 
self-adjoint with respect to I, such that 


II(v, w) = I(B(v), w) = I(v, B(w)). (38) 
By a standard computation, it turns out that 
Biv) = +V™N, (39) 


the sign depending on whether M is Riemannian or Lorentzian, where we have used 
the differential of the embedding o to identify a vector v € T,S to a vector tangent 
to the embedded surface o(S). Indeed, by applying the condition of compatibility of 
the metric of the Levi-Civita connection to the condition |g(N, N)| = 1, it is easily 
checked that VN is orthogonal to N, hence is tangent to a(S). Since B is self- 
adjoint with respect to I, it is diagonalizable at every point. The eigenvalues of B are 
called principal curvatures. 

The extrinsic curvature is the determinant of the shape operator, i.e. the product 
of the principal curvatures. The mean curvature is the trace of the shape operator, 
that is the sum of the principal curvatures. 
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Fundamental theorem of immersed surfaces. The embedding data of a smooth 
immersed surface in a 3-manifold are usually considered as the first fundamental 
form and the shape operator, or equivalently, the first and second fundamental forms. 
However, these two objects are not independent and satisfy some coupled differen- 
tial equations usually called the Gauss—Codazzi equations. We start by expressing 
such equations in the setting of the constant curvature 3-manifolds introduced above, 
namely Euclidean and Minkowski space, and the model spaces. 

The Codazzi equation can be expressed in the same fashion for all ambient spaces, 
and it says that the exterior derivative of the shape operator B, with respect to the 
Levi-Civita connection of the first fundamental form (denoted by V'), vanishes. In 
formulae, 


d™'B := (V!B)(w) — (V1,B)(v) = VEB(w) — VE, B(v) — Blv, w] = 0. (40) 


The Gauss equation is a relation between the intrinsic curvature of the first fun- 
damental form, and the extrinsic curvature of the immersion. The form of the Gauss 
equation, however, depends on the ambient metric (whether it is Riemannian or 
pseudo-Riemannian) and on its curvature. In particular, if Ky denotes the curvature 
of the first fundamental form, in Euclidean space the Gauss formula is 


Ky, = det B, 
while in Minkowski space, for immersed space-like surfaces, the correct form is 
Ky = —det B. 


As we said, if the ambient manifold has nonzero curvature, there is an additional term 
in the equation. For instance, for 8? or Ell? (curvature 1 everywhere), 


K, = 1+ det B, 
while for surfaces in hyperbolic space we have 
Ky = —1+ det B. 


Finally for space-like surfaces in Lorentzian manifolds of nonzero constant curvature: 
in de Sitter space the Gauss equation is 


K, = 1—det B, 
and in anti-de Sitter space of course 


K, = —1—det B. 
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We summarize all the statements in the following: 


Fact 6.1. Given a smooth immersed surface in a three-dimensional model space 
(or in Euclidean/Minkowski space), the first fundamental form and the shape op- 
erator satisfy the corresponding Gauss—Codazzi equations. 


Clearly, if one post-composes a smooth immersion with an isometry of the ambi- 
ent space, the embedding data remain unchanged. A classical theorem in Euclidean 
space, which can be extended to all the other cases of constant curvature, states that 
the Gauss—Codazzi equations are necessary and sufficient conditions in order to have 
the embedding data of a smooth surface, and the embedding data determine the sur- 
face up to global isometry. See [7] and [63] for a reference. 


Theorem 6.2 (fundamental theorem of immersed surfaces). Given a simply con- 
nected surface S and a pair (1, B) of a Riemannian metric and a symmetric (1, 1)- 
tensor on S, if (1, B) satisfy the Gauss—Codazzi equations in Euclidean (resp. Min- 
kowski, elliptic, hyperbolic, de Sitter or anti-de Sitter) space, then there exists a 
smooth immersion of S having (1, B) as embedding data. Any two such immersions 
differ by post-composition by a global isometry. 


Duality for smooth surfaces. We can use the description of duality introduced in 
Section 2.4 to talk about duality for convex surfaces. In fact, consider a surface S, 
which is the boundary of a convex set. Then its dual is again a convex set with 
boundary a surface S*. For instance, the dual of the boundary of a convex set in H? 
is a space-like surface in dS? (and vice versa). Similarly one can consider a convex 
set in Ad$?, whose boundary is composed of two space-like surfaces, and dually one 
obtains a convex set in Ad$? bounded by two space-like surfaces. 

Let us consider a convex set with boundary a smooth (or at least C 2) embedded 
surface S, such that B is positive definite at every point (which in particular implies 
strict convexity). The third fundamental form of S is 


Il(v, w) = 1(B(v), B(w)). 


First, observe that, since B is positive definite, III is a Riemannian metric. The 
reader can check, as an exercise, that in his/her favorite duality of ambient spaces, 
(III, B~') are the embedding data of a space-like surface, namely they satisfy the 
Gauss—Codazzi equations provided (I, B) satisfy the Gauss—Codazzi equations. It is 
helpful to use the following formula for the curvature of II(v, w) = I(B-, B-), 


Ky 
det B’ 


which holds under the assumption that B satisfies the Codazzi equation (see [44] 
or [43]). 


Kin == 
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Fact 6.3. The pair (III, B~') are the embedding data of the dual surface S*. 


Let us check this fact in the hyperbolic-de Sitter case. If S is a convex surface in 
H?, for x € S, N(x) is a point inside the unit tangent sphere of 7,,H3, and III is by 
definition the pull-back by N on S of the spherical metric. But on the other hand, the 
hyperplane tangent to S at x is naturally identified, via the double cover of Min‘, toa 
time-like hyperplane, and N(x) is a unit vector orthogonal to this hyperplane, hence a 
point in dS”, the double cover of de Sitter space. So III is exactly the induced metric 
on the dual surface, and by the involution property of the duality, its shape operator 
is the inverse of the one of B. 


6.2 Geometry of surfaces in co-Euclidean space 


We are now ready to define the second fundamental form of any space-like surface 
in *E?. For simplicity (since this is a local notion) we will use the double cover 
co€uc? & §? x R. A space-like surface in co€uc? is locally the graph of a function 
u:Q —> R, for Q C S?, and the first fundamental form is just the spherical metric 
on the base 8?. 

Given a space-like immersion o: S > co€uc?, again there is no notion of normal 
vector field, since the metric of co€uc? is degenerate. However, we have already 
remarked several times that the vector field T (of zero length for the degenerate metric, 
see Section 5.2) is well-defined, i.e. it is invariant by the action of the isometry group. 
Given two vector fields 6, ® on S, using symmetry and compatibility with the metric, 
it is easy to prove that the tangential component of Veet (ox W) in the splitting 


Tix1)cobuc? = Tx,1)0(S) ® (T), 


coincides with the Levi-Civita connection of the first fundamental form. This enables 
to give the following definition: 


Definition 6.4. Given a space-like immersion 0: S + co&uc?, the second funda- 
mental form of S is defined by 


co Guc3 A A 
V (o.6) = V, (0) + Mv, w)T, 


OxU 
for every pair of vectors v, w € TS, where wW is any extension on S of the vector 


weT,S. 


It is easy to check, as in the classical Riemannian case, that II is linear in both 
arguments, and thus defines a (0, 2)-tensor. 
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Definition 6.5. The shape operator of 0: S + co€uc? is the symmetric (1, 1)-tensor 
B such that I(v, w) = I(B(v), w) for every v, w € 7S, where I is the first fun- 
damental form of S'. The extrinsic curvature of S is the determinant of the shape 
operator. 


As usual, the definition does not depend on the extension of any of the vectors v 
and w and the second fundamental form is symmetric. 


Lemma 6.6. Given a space-like embedded graph S in co€uc?, consider the embed- 
ding 0: — co&uc? & §? x R defining S as a graph: 


a(x) = (x, u(x)) 
for u:Q — Rand Q C 8?. Then the shape operator of S for the embedding o is 
B= Hess* u + uld, 
where Hess®'u = VS" grad u denotes the spherical Hessian of u. 


Before proceeding to the proof of this proposition, let us relate functions on Q2 C 
S? C E? to functions on an open set of E>. As the unit outward vector field of S? is 
the identity, it follows from (37) and (39) that, at a point x € 8”, for X,Y tangent to 
the sphere, 


DyY =VS-Y —(X,Y)x (41) 
where (-,-) := 53,9. From this it follows immediately that, for U the 1-homogenous 
extension of u, 

(Hess U(X), Y) = X.Y.U — (Dx Y).U 
= Hess® U|g2(X, Y) + (X, ¥)x.U, 
and as U is 1-homogenous, by the Euler theorem, x.U = U, so on 8? 


Hess U = Hess®* u + uld. (42) 


Proof. Fix a point x9 € Q, and denote by b* = x,y, + x2y2 + x33 the ambient 
degenerate quadratic form. By composing with an element in Isom(co€uc’) of the 
form (x,t) +> (x,t + f(x)), where f(x) = (x, po), we can assume S is tangent 
to the horizontal plane S? x {0} at xo. Indeed, observe that by the usual duality, this 
statement is equivalent to saying that the point in E> dual to Ty (x,)S is the origin and 
the plane in E? dual to x9 is an Such a condition can be easily obtained by applying 
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a translation of E?. From (42), Hess®” f + fid = 0, and thus it suffices to prove 
the statement in this case. 

We consider S? x {0} inside the copy of IE obtained as {x4 = 0} in R*. Let 6, 
W be two vector fields on Q C S?. Then at xo € S? one has as in (41) 


fe Be yay ck 
Dsw = y w — (0, W)Xxo. 


Consider now the vector fields o,(¢) = 6 + du(0)T and o,(w) = w+ du(w)T 
on S. We choose extensions V and W in a neighbourhood of S which are invariant 
by translations ¢ +> f + fg in the degenerate direction of 8? x R. We can now compute 


DyW = Dsw + Dsg(du(W)T) + du(t) DrW 
= Ved — (6, %) x0 + 0.(grad u, )T 
= V8" — (8, ) xo + ((Hess® u(d), ) + (gradu, VE w))T, 
where in the first equality we have substituted the expressions for V and W, and in 
the second equality we have used that DT = 0 and that the chosen extension W is 
invariant along the direction of T. Recalling that the connection ve W at Xo is 


defined as the tangential component of Dy W with respect to the transverse vector 
Nxo = Xo and using that grad u vanishes at xo by construction, we get at x9 


Veen W = VS" + I(Hess* u(v), w). 


By Proposition 5.3, Ve is tangent to the slice $7 x {0} itself, and thus the second 
fundamental form is 5 
II(v, w) = I(Hess* u(v), w). 


Since u(x) = 0, this concludes the proof. 
Corollary 6.7. A space-like surface in * I? is a plane if and only if B = 0. 


Proof. Totally geodesic planes in co€uc? are graphs of functions of the form u(x) = 
es 

(x, p), which are functions such that Hess* u + uId vanishes. Since the statement 

is local, this is true also in *E?. 


Recall that the induced metric on a space-like surface S in co€uc?, written as a 
graph over Q C 87, is just the spherical metric of S?. Hence the first fundamental 
form I of S satisfies 

Ki = 1. (43) 


3 as well. Moreover, it turns 


out that the shape operator satisfies the Codazzi equation with respect to I: 


Of course this statement is local, and thus holds in *E 


dV B=O. (44) 
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Indeed, let U be the 1-homogenous extension of u. Let B = HessU. Each 

component Bi, i = 1,2,3,is a 1-form on R3, 

Bi = Bi dx! + Bax” + Bi dx’, 
with Uj; = BE, By the Schwarz lemma, Bis closed, and by (40) and (41), B satisfies 
the Codazzi equation on 8”. See also [35, 2.61] for the relations between closedness 
of 1-forms and the Codazzi equation. 

The above two equations (43) and (44) can be interpreted as a baby-version of 
the Gauss—Codazzi equations for co-Euclidean geometry. Of course this is a very 
simple version, since the two equations are not really coupled: the first equation is 
independent of B. In fact, the proof of the fundamental theorem of immersions will 
be very simple. But before that, let us show that any tensor satisfying the Codazzi 
equation on S? can be locally written as Hess® u + uld. 

Let Q be a domain in 8? and let Q be the cone over Q from the origin of E?. Note 
that Q is star-shaped from the origin. Let B be a (1, 1)-tensor on Q. Let Bbea di, 1)- 
tensor of Q which satisfies By (x) = 0, and for (X,x) = 0, B,(X) = Bx six (X), 
where || - || is the usual norm on E?. 

Suppose that B satisfies the Codazzi equation on &2, and consider B as an 
R3-valued 1-form. From (40) and (41), B satisfies the Codazzi equation on Q, hence 
is a closed form, and by the Poincaré Lemma, there exists F’ ?-R3 — R such that 
Bi = dF’, Suppose moreover that B is symmetric. This readily implies that B is 
symmetric, and hence that the 1-form w = Fidx'!+4+ Fd ae + F3d te is closed, 
so by the Poincaré Lemma, there exists a function U: Q > Rwithw = dU , Le. 
B = Hess U. From (42), we obtain the following fact, previously noted by D. Ferus 
in [12]: 


Fact 6.8. Let B be a tensor satisfying the Codazzi equation on a surface of constant 
curvature equal to +1. Then locally there exists a function u on 8 such that B = 
Hess®* u + u Id. 


Proposition 6.9 (Fundamental theorem of immersed surfaces in co-Euclidean geom- 
etry). Given a simply connected surface S and a pair (1, B) of a Riemannian metric 
and a symmetric (1, 1)-tensor on S, if (I, B) are such that 1 has constant curvature 
“li 


kK, = 1, 
and B is satisfies the Codazzi conditions for I, 
dB =0. 


Then there exists a smooth immersion of S in *IE> having (1, B) as embedding data. 
Any two such immersions differ by post-composition by a global isometry of *I°. 
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Proof. Again, it suffices to prove first the immersion statement in co€uc?. By Fact 6.8 
there exists a function u: S > R such that B = Hess® wu + uld. Let dev: § + 8? 
be a developing map (that is, a local isometry) for the spherical metric Ion S. Then 
we define 

a: S —> coSuc? = 8* xR 


by means of 

o(x) = (dev(x), u(x)). 
Since dev is a local isometry, and the degenerate metric of co&uc? reduces to the 
spherical metric on the S? component, the first fundamental form of o is I. By 
Lemma 6.6, the shape operator is B. 

Given any other immersion o’ with embedding data (I, B), the projection to the 
first component is a local isometry, hence it differs from dev by postcomposition by 
a global isometry A of 87. By composing with an isometry of co€uc? which acts 
on 8? by means of A and leaves the coordinate ¢ invariant, we can assume the first 
component of o and o’ in S* x R coincide. 

Hence we have o’(x) = (dev(x), u/(x)) where uw’ is such that B = Hess®* u! + 
u! Id. Therefore Hess*” (u —u’) + (u —u’)Id = 0, which implies 


u(x) —u'(x) = (dev(x), po) 
for some po € E3. This shows that o and o’ differ by the isometry 


(x,t) > (x,t + (x, Po)) 


which is an element of Isom(co&uc*). This concludes the proof. 


Let us remark that (recalling the discussion of Section 3.3) if S is a space-like 
surface which is homotopic to $? x {«} in coGuc? & 8? x R and its shape operator B 
is positive definite, then S is the graph of u where u is a support function of a convex 
body whose boundary is S. Let us now compute the embedding data in Euclidean 
space of the dual surface of S. The formulae we will obtain are exactly the same as 
in the Ad$?-A dS? duality and in the H?-d$? duality. 


Corollary 6.10. Given a space-like smooth surface S in? (resp. * IE) which bounds 
a convex set and with positive definite shape operator, if the embedding data of S are 
(I, B), then the boundary of the dual convex set in *IE> (resp. I?) has embedding 
data (Ill, B~'). 


Proof. If the result is proved for one case, then the other case is obvious, since it 
suffices to exchange the roles of (I, B) and (III, B~!). Let us consider a surface S in 
E>, and let S* be the dual surface in *E?. 
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Let us assume that in the double cover co&uc?, S* is locally the graph of u over 
an open subset Q of S?. To check that the first fundamental form of the dual surface 
is the third fundamental form, it suffices to compute the pull-back of the spherical 
metric of S? by means of the first component of the dual embedding o: S — *IE? for 
S*; that is, the map which associates to a tangent plane P = 7), to S the dual point 
P* ¢*IE%. Again in the double cover, the first component of the dual embedding is 
precisely the Gauss map G of S, and the derivative of the Gauss map is the shape 
operator, hence one gets 


(do(v),do(w)) = (dG(v),dG(w)) = I(B(v), B(w)). 


Moreover, one can directly show that the inverse of the shape operator of S', by means 
of the inverse of the Gauss map Ghis+S cE ,isB! = Hess® u + uld, 
where u:S? — R is the support function of the convex set bounded by S. Using 
Lemma 6.6, this concludes the proof. 


6.3 Geometry of surfaces in co-Minkowski space 


By means of a very analogous construction, one can study the geometry of surfaces 
in co-Minkowski space. We will not repeat all the details, but just state the parallel 
results. Here (-,-) := 62,1 will denote the Minkowski product of Min?, and we will 
implicitly identify * Mim?, which is the quotient of coMin? ~ 3? x R by the action 
of {+ Id}, to one of its connected components, thus considering the model of H? x R. 
The definition of second fundamental form of an immersion o: S — * Min? makes 
again use of the splitting: 


T,coMin? = T,o(S) ® (T). 


Definition 6.11. Given a space-like immersion 0: S > *Min?, the second funda- 
mental form of S is defined by 


* IN 3 mn 3 x & 
V, him’ (ont) = VIC) + Hv, w)T, 


OxvU 


for every pair of vectors v, w € 7T,S, where wW is any extension on S of the vector 
weT,S. 


Definition 6.12. The shape operator of o: S —> *Mim? is the symmetric (1, 1)- 
tensor such that I(v, w) = I(B(v), w) for every v, w € T,S, where I is the first 
fundamental form of S. The extrinsic curvature of S is the determinant of the shape 
operator. 
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Lemma 6.13. Given a space-like embedded graph S in *IMim?, consider the embed- 
ding 0: Q — *Min? ~ H? x R defining S as a graph: 
a(x) = (x, u(x)) 
for u:Q — Rand Q C HH. Then the shape operator of S for the embedding o is 
B= Hess!” u—uld, 
where Hess y = V¥" grad u denotes the hyperbolic Hessian of u. 


Corollary 6.14. A space-like surface in *Min? is a plane if and only if B = 0. 


Again, one can observe that the embedding data of a smooth space-like surface 
in * Min? satisfy the very simple condition that the first fundamental form I is hyper- 
bolic, and the shape operator satisfies the Codazzi equation for I. 


Proposition 6.15 (fundamental theorem of immersed surfaces in co-Minkowski ge- 
ometry). Given a simply connected surface S and a pair (I, B) of a Riemannian 
metric and a symmetric (1, 1)-tensor on S, if (I, B) are such that | has constant cur- 
vature —1, that is, 


Kkj=—l, 
and B satisfies the Codazzi conditions for I, that is, 
dB =0. 


then there exists a smooth immersion of S in *Min? having (I, B) as embedding 
data. Any two such immersions differ by post-composition by a global isometry of 
*Min?. 


Finally, the formulae for the embedding data of dual surfaces work well in the 
Min?-* Min? duality: 


Corollary 6.16. Given a space-like smooth surface S in Min? (resp. *Mim*) which 
bounds a convex set and with positive definite shape operator, if the embedding data 
of S are (I, B), then the boundary of the dual convex set in * Min? (resp. Mim?) has 
embedding data (III, B~'). 


6.4 Geometric transition of surfaces 


We now conclude by showing that the notion of curvature in co-Euclidean and co- 
Minkowski geometry, under the procedures of geometric transition we have encoun- 
tered, is the rescaled limit of the usual notions in the model spaces which degenerate 
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to *IE? and *IMin?. In particular, we will focus on the transitions defined in Sec- 
tion 4.2, having limit in co-Euclidean and co-Minkowski spaces. 


Proposition 6.17. Suppose o; is a C? family of space-like smooth immersions of a 
simply connected surface S into EI or d$3 (resp. AdS? or 1H3), such that oo is 
contained in a totally geodesic plane P. Let 


— i * 
o= lim (g; © 04) 


be the rescaled immersion in * 5° (resp. * Mim?) obtained by blowing-up the plane P.. 
Then, 


* the first fundamental form of o coincides with the first fundamental form of 00: 


I(v,w) = lim I,(v, w); 


¢ the second fundamental form of o is the first derivative of the second funda- 
mental form of 0¢: 
II,(v, 
I(v, w) = lim cA 
t>0 t 


¢ the shape operator B of o is the first derivative of the shape operator By of o;: 
Biv 
BOY Sie 
t>0 t 


° the extrinsic curvature K**' of o is the second derivative of the extrinsic cur- 
vature K&* = det By of oy: 


K&t(x) 
Kt = jj t : 
(*) 10 t2 

Proof. As in Section 4.2 (recall also the proof of Proposition 5.14), we can assume 
that the totally geodesic plane P is defined by x4 = 0 in R P?, and use the projective 
transformations 

0 0 

1 0 
c= 0 1 € PGL(4, R). 
0 


So oor 
~|— © Oo © 
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Hence, the limit of gf o; is 


(04(x))1 (0(x))1 
(04(x))2 (d0(x))2 
lim g7or(x) = lim (o7(x))3 | = | (Go(x))3 | > 
ote pay 
t 


t 


namely, o(x) = (00(x), u(x)) for some function uv which encodes the derivative of 
the vertical component of 0;. It is then clear, from the degenerate form of the metric 
of both *E3 and *Mim?, that in both cases the first fundamental form of the rescaled 
limit o is the limit of the first fundamental form of o;. 

For the second point, we will focus on the case of Ell? (with limit in IE) for 
definiteness. Since the statement is local, we can consider the computation in the dual 
covers $7 and co€uc*. Consider the unit normal vector fields N; to o;(S), chosen 
so that at time f = 0 the vector field is (0,0, 0, 1) and N; varies continuously with f. 
Recall that the second fundamental form of o;(S) satisfies 


II, (v, w) 


3 a A 
Ve (orb) — Vii () = I (v, w)N; = 


t- 


Now applying the transformation g7, we have 


co &uc 
ov 


ae 4 vs", (o;W) converges to V “ow by Proposition 5.14; 


° gt Vi! (w) converges to a tangential component to o(S), since Vy! (w) is tangent 
to o;(S); 


* tg*(N;) converges to T = (0,0,0, 1) since g*(N;) = (N}, N?, N}?, N#/t) and 
No = (0,0, 0, 1). 


This shows at once that II,(v, w)/t converges to the second fundamental form of o 
in co€uc?, and g* V1! (2) converges to Vi (w). 

Since II;(v, w) = I;(B;(v), w) and H(v, w) = I(B(v), w), the third point fol- 
lows from the first two statements. The last point is a consequence of the third point 
and the fact that Ke*' = det B; and K**' = det B. 

Clearly the proof is completely analogous for the convergence from d$? to *E3, 
or from the convergence from H? or Ad? to *Min?. 
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6.5 Projective nature of infinitesimal rigidity 


Let S be a smooth space-like surface in E3, Min?, Ell?, H3, d$3, or Ad$3. A vec- 
tor field Z on S is called an infinitesimal isometric deformation if it preserves the 
induced metric on S at the first order: £7 g(X, Y) = 0 for all X, Y € TS, which is 
easily shown to be equivalent to g(Vy Z, X) = 0 for all X € TS, if g is the metric 
of the ambient space and V its Levi—Civita connection. An infinitesimal isometric 
deformation is said to be trivial if it is the restriction to S of a Killing field of the 
ambient space. 


Now let us suppose that S embeds in an affine chart, and that this affine repre- 
sentation is also an affine chart for another model space, in which the image of S is 
space-like. By a straightforward adaptation of the proof of Lemma 5.18, it is easy to 
see that the infinitesimal Pogorelov map sends infinitesimal isometric deformations 
of S for the first ambient metric to infinitesimal isometric deformations of S for the 
second ambient metric. Moreover, Lemma 5.18 also says that trivial deformations are 
sent to trivial deformations. A surface S is infinitesimally rigid if all its infinitesimal 
isometric deformations are trivial. 


It follows from the above discussion that the infinitesimal rigidity of a surface in a 
three dimensional affine space is independent of the choice of the projective distance 
on the ambient space. 


Comments and references 


¢ A classical problem in Riemannian geometry is the existence of zero mean 
curvature surfaces with prescribed boundary at infinity. For instance, in hyper- 
bolic space the so-called asymptotic Plateau problem was proved by Anderson 
in [5]. In anti-de Sitter space, the analogous problem was tackled by Bonsante 
and Schlenker in [16]. With the tools introduced in this paper, the analogous 
problem in *Mimn? turns out to be very simple. Indeed, taking the trace in the 
expression B = Hess!” y—wId for the shape operator of a space-like surface, 
one obtains that a surface has zero mean curvature if and only if Au — 2u = 0 
where A denotes the hyperbolic Laplacian. Hence a proof follows from the 
existence and uniqueness of solutions to this linear PDE. 


Volkov [68] showed the following: a vector field Z on S is an infinitesimal 
isometric deformation if and only if for each x € S there exists a Killing field 
K* such that Z, = (K*), and (Vy Z), = (Vy K*), for all X € 7,8. 
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To simplify the presentation, we have considered different metrics on the same 
(affine) space. But we could have considered maps between different spaces, 
sending geodesics onto geodesics. In particular, it would follow from the re- 
sult presented here that infinitesimal rigidity in the Euclidean space is invariant 
under projective transformations. This fact is known since the end of the 19" 
century and called the Darboux—Sauer theorem, see e.g. [31, Chapitre IV]. This 
is also true for more general surfaces than smooth ones. For example consider 
polyhedral surfaces made of pieces of space-like planes. An infinitesimal iso- 
metric deformation is then defined as the data of a Killing field on each face, 
such that they coincide on common edges. Then Lemma 5.18 implies that the 
infinitesimal Pogorelov map sends deformations to deformations and trivial de- 
formations to trivial deformations. This invariance of infinitesimal rigidity is 
always true if one considers the more general case of frameworks, that includes 
polyhedral surfaces [39, 38]. 


The Pogorelov map is a map taking two convex surfaces S;, Sz in EII? and giv- 
ing two convex surfaces S1, S> of E>. If S; is isometric to S> (for the distance 
induced by the ambient metric) then S, and S> are also isometric. Moreover, 
if the isometry between S; and Sz is the restriction to S, of an isometry of the 
ambient space, then the isometry between S, and S> is also the restriction of 
an isometry of the ambient space. The difficult part is to prove that S; and S> 
are convex [51]. The differential of this map at S = S$; = Sp» gives the in- 
finitesimal Pogorelov map. The map was extended to H? [51] and to dS? [47] 
instead of Ell, see also [56, 57]. 
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1 Introduction 


The aim of this chapter is to present the trigonometry of complex projective spaces. 
Section 2 is devoted to a congruence theorem for triangles in C P? obtained by Brehm. 
We prove it here by another method which we presented in [3] and in [7]. It is shown 
that a triangle ([x], [y], [z]) in CP? with ||x|| = ||y|] = ||z|| = 1 is uniquely deter- 
mined up to isometry by its side lengths a = arccos(|{y,Z)|), b = arccos(|({x, Z)|) 
and c = arccos(|(x, y)|) and its shape invariant, the real part of the complex number 


(x, y) (y, z) (z,x) = cos(a) cos(b) cos(c)e!”. 


Section 3 consists of Hermitian trigonometry. as it is developed in [4]. Precisely, we 
give the two laws of sines and the law of cosines of a triangle in C P?. In Section 4, we 
define the polar triangle associated to a given triangle and derive from it the second 
law of sines which does not appear in [2]. On the other hand, if we consider a triangle 
([x], [Ly], [z]) and its polar triangle ([X], [Y], [Z]), and if we set 


(X,Y) (Y, Z)(Z, X) = cos(p) cos(q) cos(r)e!®, 
we show that w and Q are linked. Indeed, consider the complex number 


(uz, U1) (V1, V2) (Wy, W2) = |(U2,U1) (v1, V2) (Wr, wW2)| et, 
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where U1, U2; U1, V2; W1, W2 are pairs of unit tangent vectors at the vertex [x], [y], 
[z] respectively. It is displayed that 


Q=-a-20 +(2k+1)n, keEZ. 


This result is new in comparison with the results in the papers quoted. In addition, we 
relate Brehm’s invariants to those of Hsiang [10] and those of Shirokov and Rozen- 
feld [14]. We close this paper by some remarks on trigonometry in general symmetric 
spaces. 


2 Congruence theorem for triangles in C P? 


In classical geometry, Euclidean, spherical, and hyperbolic spaces are commonly 
characterized by the same set of congruence axioms for geodesic intervals and tri- 
angles. In the language of Riemannian geometry, these axioms are stated as fol- 
lows ({10]): 


1. two geodesic intervals A B and A’ B’ are congruent if they have the same length; 


2. two geodesic triangles are congruent if they have two equal angles contained 
by a pair of equal sides. 


Let ABC be a geodesic triangle in a symmetric space M of rank 1 and A(M) 
the moduli space of congruence classes of triangles in M. In the well-known clas- 
sical spaces, a triangle has six basic invariants (side lengths and angles) which map 
A(M) into R®°. The image set is of dimension 3 since the 6 invariants satisfy three 
basic functional relations, the laws of sines (which involve 2 equalities) and cosines 
(one equality). Riemannian symmetric spaces of rank | are precisely the two-point 
homogeneous spaces [18]. 

In 1994, in his Ph.D. thesis [7], the author gave new congruence criteria for fi- 
nite subsets in both complex projective and complex hyperbolic spaces. The case of 
triangles is then easily derived. Using this point of view, he derived an easy congru- 
ence theorem for triangles. This leads directly to the laws of trigonometry in CP? 
obtained by Brehm, from which one can easily derive those of the sphere. 

We consider points in CP”, n > 2 as one-dimensional subspaces of the vector 
space C”t!, For x € C”*!, x # 0, let [x] := C+ x denote the corresponding point 
in CP”. Let (x,y) = ea : x;y; be the usual Hermitian product and let d be the 
Fubini—Study metric on C P” defined by 


d([x], [y]) = arecos Css a): 
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Itis well known that ®: CP” — CP” is an isometry of C P” if and only if there exists 
a unitary matrix U such that either ®([x]) = [Ux] for all x ¢ C”*! or &([x]) = [Ux] 
for all x € C?*?, 


Let [x], [y], [2] ¢ CP” with |x|] = |lyl] = [lz] = 1 
Definition 2.1. A triangle is a triple ([x], [y], [z]) of three different points in C P” 
such that the distance between any two of them is strictly smaller than 4. 


We imagine the vertices [x], [y], [z] of a triangle being connected by the unique 
shortest paths between them, called sides. We set 


a = arccos(|{y, Zz) 


), b =arccos(|{x,z)|), ¢ = arccos(|(x, y)|). 


Consider the Hermitian matrix 


I 4%, 9) - (%y2) 
H = | (y, x) I, hae) 
(z,x) (z,y) 1 


which is positive semidefinite with 1’s on the diagonal. This matrix can be brought, 
by multiplication of columns by unimodular complex numbers and the corresponding 
rows by the conjugate of those complex numbers into the following form 


1 cos(c) cos(b) 
cos(c) 1 cos(a)e!® |. (1) 
cos(b) cos(a)e~!? 1 


The complex number e! is called the direct shape invariant in [3]. It was consid- 
ered by Blaschke and Terheggen. Terheggen interpreted it in spherical geometry [15]. 
The real number cos is called by Brehm the shape invariant. It was shown by 
Brehm [4] that a triangle in CP” is determined, up to isometry, by its side lengths 
and its shape invariant. Brehm used the invariant 0 = cos(a) cos(b) cos(c) cos(@). 
The author showed in his Ph.D. thesis that two triangles in CP” are directly con- 
gruent (if we do not allow conjugation) if and only if they have equivalent associated 
matrices. We already saw that the form (1) is determined up to equivalence by the real 
numbers a, b, c, and cos(@) (conjugation is allowed). Now let H be any Hermitian 
positive semidefinite 3-matrix with 1’s on the diagonal. Clearly it can be decomposed 
as the product T*7T where T is an upper triangular matrix (n + 1,3), > 2 with 
diagonal positive real numbers and 7% is the transpose of its conjugate. The columns 
of T are the representing vectors of the desired triangle. 

More general results on finite subsets in complex projective spaces can be found 


in [3]. 
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A straightforward calculation gives the following vectors which are the columns 
of the upper triangular matrix T, 


x = (1,0,0,...,0), 
y = (cos(c), sin(c),0,...,0), 
zZ = (cos(b), Z2 + 25, 23,0,...,0), 


where 
cos(a) cos(@) — cos(c) cos(b) 
22 = _ss—F (2) 
sin(c) 
, _ cos(a) sin(@) 
72 = sin(c) G3) 


ile cos?(a) — cos?(b) — cos?(c) + 2 cos(a) cos(b) cos(c) cos(w))!/? 


Zz 
sin(c) 


(4) 
The triple ([x], [y], [Z]) is called in [4] the normal form of the congruence class of 
the triangle. We have then proved the following theorem due to Brehm. The proof 
presented here is slightly different from Brehm’s one and was obtained in the author’s 
Ph.D. thesis [7] and published in [3]. 


Theorem 2.2. 1. A triangle in C P” is determined uniquely up to isometry by its side 
lengths and its shape invariant. 

2. LetO0 < a,b,c < . ando € R. Then there exists a triangle in CP” with side 
lengths a,b,c and shape invariant o if and only if 


5 (c032(a) + cos?(b) + cos?(c) — 1) < o < |o| < cos(a) cos(b) cos(c). 


3 Complex projective trigonometry 


Let x € C”*!, ||x|| = 1. The tangent space of CP” at [x] can be canonically 
identified with (C-x)+ in C"*!. The real part t((, )) of the Hermitian product is the 
Riemannian metric of CP”. We represent as in [4] the tangent unit vector u(x, [y]) 
at [x] of the shortest geodesic from [x] to [y] as 


y (yx) — x |(y, x)? 


u(x, [y]) = ly (yx) —x I(y, x) |? | 
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Consider the normal form given above and let us set uy = u(x,[y]) and uz = 
u(x, [z]). By a straightforward calculation one has 


ui = (0,1,0,...,0), v2 = (0,22 + z4i,z3,0,..., 0) sin '(B). 
The angular invariants of [4] are defined by 
A(A) := arccos(#t((u1, u2))), 
®(A) := arccos(|{u1, u2)|), 
W(A) := aresin(|S((w1, u2))|), 


where A, B, C correspond to the corners [x], [y], [z] respectively. Geometrically, 
A(A) is the angle between the geodesics, ®(A) is the angle between the projective 
lines generated by wu; and wz and (A) is the smallest angle between the geodesic 
from [x] to [z] and a totally geodesic RP? containing [x] and [y]. However, the 
three angles are dependent, they satisfy sin? A(A) = sin?(®(A)) + sin?(W(A)). 
Clearly the pair of angular invariants of one corner together with the lengths of its 
adjacent sides constitute a minimal set of congruence conditions. Thus, we have 
dim A(C P”) = 4. Therefore there exist five basic functional relations among the 
nine invariants of a triangle. 


A short computation yields 


22 
cos(A(A)) = sin(by’ (5) 

. a) 
sin(W(A)) = an) (6) 

: 23 
sin ®(A) = sin(by’ (7) 

Theorem 3.1. For any triangle ABC in CP”, 

sin(®(A)) _ sin(®(B)) _ sin(®(C)) 8 
sin(a) —sin(b)~——ssin(c) (8) 
sin(¥(A)) — sin(W(B)) _ sin(¥(C)) 9 
sin(2a) —sin(2b) ~——ssin(2c) ” ©) 
cos (a) — (cos(b) cos(c) + sin(b) sin(c) cos(A(A)))” a0 
) 


= sin? (W(A)) sin?(b) sin? (c). 
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Proof. Equation (8) is obtained using (4) and (7). Equation (9) is obtained from (3) 
and (6). Now for the law of cosines we have from (2) and (5), 


cos(a) cos(b) cos(c) cos(@) 


11 
= cos(b) cos(c) sin(b) sin(c) cos(A(A)) + cos? (b) cos?(c), we 
and from (4) and (7), 
2 cos(a) cos(b) cos(c) cos(@) 
(12) 


= sin?(®(A)) sin?(c) sin?(b) — 1 + cos?(a) + cos*(b) + cos?(c). 


Combining (11), (12), and the relation between A, ® and W we arrive easily to the 
law of cosines (10). 


The law of cosines (10) does not appear in this form in Hsiang’s and Brehm’s arti- 
cles. From (10) we find again the well-known law of cosines in elliptic and spherical 
spaces. 

Brehm’s law of cosines reads as follows: 


cos(2b) cos(2c) — cos(2a) + cos(A(A)) sin(2b) sin(2c) 


(13) 
= 2sin?(®(A)) sin?(b) sin? (c). 
Remark 3.2. 1. If ®(A) = 0 then (13) is exactly the classical cosine law for spherical 


triangles, noting that C P! is isometric to the sphere of radius s. 


2. If (A) = 0 then (10) is exactly the classical cosine law for elliptic and thus 
spherical triangles. 


3. The congruence theorem for geodesic triangles was stated for the first time by 
Blaschke and Terheggen [2]. They showed that the congruence class of a geodesic 
triangle in a complex projective plane is determined by its side lengths and a fourth 
invariant cos ¢, satisfiying 


cos(®(A)) sin(@) = sin(W(A)). 


Later, Terheggen interpreted in [15] the invariant @ as an angle in spherical geometry. 
In addition, he showed that all spherical trigonometry in the sense of Study [16] is 
contained in the geometry of the complex projective line. Although Blaschke and 
Terheggen were unaware that we need five relations, they obtained three. By means 
of the notion of polar triangle in a complex projective plane, they obtained a sine and 
a cosine law. However, it is seen below that the second sine law can be easily de- 
rived from their investigation. Their sine law leads easily to the well-know spherical 
sine law obtained by Coolidge [5]. Another approach was put forward by Shirokov, 
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in a paper published posthumously by Rozenfeld [14]. Shirokov took two angular 
invariants at each vertex: the Riemannian angle A(A) between the two sides and the 
holomorphy inclination y(A). It is seen that 


sin(A(A)) cos(y(A)) = sin(W(A)). 


4 Blaschke and Terheggen laws 


In C3, for any vectors x and y, Blaschke and Terheggen defined the product x x y 
which is a vector in C? with coordinates 


XsVr—X2Ys, (st) € {(1,2), (2,0), 0, 1)} 
and which satisfies 

1. (x x y,x) = 0, (x x y, y) =0, 

2. (x x yew xv) = (xu) (y.v) — (x0) (yu), 

3. (x x y) x UX v) = (xuv)y — (yur)x. 


Item 2 is Bianchi’s identity and (xv) is the determinant of the three vectors. Now let 
([x], Ly], [z]) be a triangle in C P? with ||x|| = || y|| = ||z|]| = 1,@ = arccos(|(y, z)|), 
b = arccos(|(x, z)|), and c = arccos(|(x, y)|). 


Definition 4.1. The triple ([X], [Y], [Z]) with 


yrzZ ZxXxXx xx y 


~~ sin(a)’ ~ sin(b)’ ~ sin(c) 


is called the polar triangle of the triangle ([x], [y], [z]). 


Now set |(X, Y)| = cosr, |(Y, Z)| = cos p and |(Z, X)| = cosq. Let us also set 
vy = u(y, [x]), v2 = u(y, [Z]), wi = uz, [y]) and w2 = u(z, [x]). Now consider 
the complex numbers w, a and 2: 


(x, y) {y, Z) (z, x) = cos(a) cos(b) cos(c)e!”, 
(uz, U1) (v1, V2) (Wi, W2) = |(u2, U1) (V1, V2) (wy, W2)| er, 
(X,Y) (Y, Z) (Z, X) = cos(p) cos(q) cos(r)e!®, 


The invariants w, Q, and a are related as follows. 
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Theorem 4.2. For any triangle ABC in CP?, 
1. ®(A) = p, P(B) = 4, O(C) = 1; 
2. Q=-a—-—2@+4+ (2k + 1)7,k € Z. 
Proof. A direct computation leads to the following formulae: 


cos(a)e!® — cos(b) cos(c) 


Witenes 
and 
cos(aje!® (u1, uz) 
Y,Z)=- 
( ) (z, y) 


By acyclic permutation of a, b, c and x, y, z and X, Y, Z we obtain 


_cos(b)e"@ (v2, 01) 


(Z,X)= ez 


’ 


(X,Y) = __cos(c)e*” (w2, w1) 


(y,x) 


(14) 


(15) 


(16) 


(17) 


Clearly from (15), (16), and (17) we arrive to cos ®(A) = cos p, cos ®(B) = cosq, 


cos ®(C) = cosr and 


(X,Y) (¥, Z) (Z, X) = —e77! (uy, ur) (v2, v1) (wa, wi), 


from which assertion 2 follows. 


All the trigonometric laws found by Blaschke and Terheggen, Rozenfeld, Shi- 
rokov, Hsiang and Brehm are the same. Of course, the second sine law was not 
found by Blashke and Terheggen although they knew that a triangle in C P? depends 
on four invariants. The second sine law is self-contained in their investigation and it 
is derived as follows. With our notation the angles ¢, x and y of [2] are given by 


. _ __cos(a) sina) —__ sin(¥(A)) 
an) cos(p)sin(b) sin(c) —cos(p) ’ 
: _ __cos(b)sin(w) _ sin(W(B)) 

mn) = cos(q) sin(c)sin(a) —_—cos(q) 
dt = cos(c) sin(@) _ sin(¥(C)) 


cos(r) sin(b) sin(a) 7 cos(r) 


(18) 


(19) 


(20) 
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It follows from the equalitie of the left hand sides of (18), (19), and (20) that 


cos(p) sin(¢) = cos(q) sin(y) Z cos(r) sin(y) ; (21) 
sin(2a) sin(2b) sin(2c) 
Formula (21) is exactly the second sine law. 
In [2] or from (14) we find the formula 
cos(b) cos(c)e?” — cos(a) 
sin(b) sin(c) 
By a direct computation, using p = ®(A), (2) and (5) we arrive easily to the cosine 
law (10). 
In [2] we find 


cos p = 


sin(p) _ sin(q) _ sin(r) 

sin(a) sin(b) __ sin(c)’ 
Clearly this is the sine law (8) since p = ®(A), q = ®(B) andr = ®(C). We can 
then consider that all the laws were found by Blaschke and Terheggen. Item 1 of the 
above theorem is known in [15] in case of triangles degenerating in real projective 
planes. On the other hand, Terheggen showed that the work of Study on spherical 
trigonometry [17] is contained in the trigonometry of the complex projective line [15]. 


Remark 4.3. Let us turn to Hsiang’s laws. Brehm’s angular invariants A(A), ®(A), 
(A) correspond to the angular invariants A(A), 5 — (A), 5 — 0(A), respectively, 
which have been defined by Hsiang in [10]. Of course Hsiang derived the same 
laws of sines but instead of (10) or (13) he obtained a more complicated equivalent 
equation which we do not give here. 

Shirokov took two angular invariants at each vertex: the Riemannian angle A(A) 
between the two sides and the holomorphy inclination y(A). By comparing his three 
laws, we easily see that the holomorphy inclination is linked to Brehm’s angles as 
follows: 

sin(A(A)) cos(y(A)) = sin(W(A)). 
The trigonometric laws found by Rozenfeld and Shirokov are the same as (8), (9), 
(10), or (13) where we replace sin(W(A)) by sin(A(A)) cos(y(A)) and sin(®(A)) by 
sin(A(A)) sin(y(A)) and by a cyclic permutation of the letters A, B, C. This can be 
found in [14], [16], and [13]. 


5 Concluding remarks 


1. A slight modification of the definitions and computation yields similar results for 
triangles in complex hyperbolic spaces. The results can be found in [4]. 
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2. It follows from the normal form that each triangle in KP” (n > 2, K = C, H) 
is contained in some totally geodesic C P?. 


3. It is known from Wolf [19] that each isometry of a totally geodesic CP? in 
Cay(P7) can be extended to the Cayley plane. Thus, it is sufficient to regard triangles 
of the Cayley plane only in C P?. The results are also valid for triangles in Cay(H7). 


4. According to Wang [18], KP”, KH”, K = R, C, H, Cay(P?), Cay(H”) are 
besides the classical spaces all the two-point homogeneous spaces. 


5. Let ([x], [y], [z]) be a triangle in C P? and let S be the surface generated by the 
geodesic arcs issued from [x] with endpoints on the geodesic arc between [y] and [z]. 
Let J be the integral of the Kahler form on the surface S. Hangan and Masala showed 
that @ = —2/ + 2kx, for some integer k, see [8]. 


6. The result of [8] made more clear a result obtained in [6]. the author constructed 
regular tetraedra of CP? which have symmetry groups isomorphic to the symmetric 
group S4. It turns out that a regular tetrahedron with edge a < arccos B is nec- 
essarily contained in a real projective sub-space. Indeed, by regularity and Stokes’ 
theorem, the integral J performed on its faces vanishes and we get from (3) and (6) 
that sin ¥(A) = 0. However, in [6], he proved that for any a such that a > arccos wee 
there exists a regular tetrahedron with edge a and with zero shape invariant. In this 
case, 27 = 4 + kz. This means that the quadruple with the geodesic arcs between 
the vertices and the faces generated by the geodesic arcs issued from a vertex with 
endpoints on the opposite edge, constitute a simplicial complex of dimension 2 whose 
homology class of dimension 2 generates the homology group H2(CP”, Z) which 


is isomorphic to Z [9]. 


Trigonometry in general symmetric spaces is still at its infancy. 


1. The trigonometry of symmetric spaces of rank 1 was studied by Hsiang in 
1989 using integration of the geodesic flow [10]. A different approach using models 
in projective spaces has been given by Brehm in 1990 [4]. The trigonometry of a 
complex projective plane was first studied by Blaschke and Terheggen in 1939 [2], 
Terheggen in 1940 [15] and later pursued in 1957 by Rozenfeld [14] and indepen- 
dently by Sirokov in 1957 [16]. 


2. Aslaksen in 1989 [1] developed trigonometry for triangles in the Lie group 
SU(3) using algebraic invariants. Leuzinger in his Ph.D. thesis in 1990 [11] devel- 
oped trigonometry for arbitrary Riemannian symmetric spaces of non-compact type 
using the general theory of symmetric spaces. 
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3. Hangan [9] wrote some trigonometric formulae for Grassmann manifolds. 


4. The congruence criteria for triangles in Grassmann manifolds was stated by 
Masala in his Ph.D. thesis in 1996 using matrices of projections [12]. 
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1 Introduction 


Inspired by Lexell’s theorem (see [9] and [10]) in spherical geometry, which states 
that for two fixed points A and C on a sphere, the locus of the third vertex B of a 
spherical triangle ABC of constant area is a small circle, N. ACampo and A. Pa- 
padopoulos [1, Theorem 5.11] have shown that, in plane hyperbolic geometry, the 
locus of the third vertex B, which lies on the same side of AC, is a hypercycle. 
Nevertheless, as shown by A. Papadopoulos and W. Su [15, Proposition 4.4], the 
midpoints of the segments BA and BC lie on a line as B moves along that hypercy- 
cle. Thus, although the locus of B bears no resemblance to its Euclidean counterpart, 
the locus of the midpoints of the segments BA and BC is identical to that in the 
Euclidean case: a line. This raises the question whether the result could be stated 
and proved inside absolute geometry. A first candidate would be Hilbert’s plane ab- 
solute geometry A, whose axioms are the plane axioms of groups I, II, and III of 
Hilbert’s Grundlagen der Geometrie (Foundations of Geometry) [6], or equivalently 
axioms Al—A9 in [16]. Its models will be referred to as Hilbert planes (see also [5]). 
The notion of area in Hilbert planes has been investigated in [4]. According to its 
results, two triangles ABC and AB’C, with B and B’ on the same side of AC, have 
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the same area in the case of the Euclidean metric (that is, if there is a rectangle in 
the Hilbert plane) if and only if the altitude from B to AC and the altitude from B’ 
to AC are congruent. In the case with non-Euclidean metric (that is, if there is no 
rectangle in the Hilbert plane) triangles ABC and AB’C, with B and B’ on the same 
side of AC, have the same area if and only if the sum of the angles is the same in 
both triangles. At first sight it seems that, for the non-Euclidean case, the axioms 
for Hilbert planes are needed to even state the result, for one needs order, that is the 
notion of betweenness, to state that the points B and B’ lie on the same side of AC, 
and one needs the ability to transport angles so that one can meaningfully refer to 
the sum of three angles that do not have the same vertex. So, it seems that order and 
free mobility (the fact that, given two distinct points A and B, there is a rigid motion 
(a one-to-one and onto mapping preserving congruence and betweenness) mapping 
A onto B, and given two distinct lines g and h, there is a rigid motion mapping g 
onto /) are required to even state the problem in its current form. 


While this is true, one can operate a small change in the statement of the problem 
and state it in a more general geometry than A. First, let us see how we can express 
area equality in the case of a non-Euclidean metric. Given triangle ABC, let U 
denote the midpoint of BC, let V denote the midpoint of AC, and let W denote 
the midpoint of AB. Let R denote the reflection of V in U, and let Q denote the 
reflection of V in W. Since VCU = RBU and VAW = OBW , the angle OBR 
represents the sum of the angles of triangle ABC. Since BR = BO = VC, and VC 
is constant, given that A and C are fixed points, the isosceles triangle BOR has its 
two congruent sides of fixed length, and the angle between two sides is congruent to 
the sum of the angles of triangle ABC. The sum of the angles of triangles ABC and 
AB'C are thus the same if and only if OR = Q’R’, where by Q’ and R’ we have 
denoted the points obtained in the manner Q and R were, only starting with triangle 
AB'C. It thus turns out that only the assumption that any segment has a midpoint is 
needed, not the entire assumption of free mobility. We also notice that order is not 
necessary, for there are two lines on which the midpoints of the sides BA and BC 
can lie, and one is the reflection of the other in the line AC. So we need a geometry 
in which we can express segment congruence, and in which any pair of points has a 
unique midpoint. 

The congruence core of plane absolute geometry was investigated in a series of 
papers by J. Hjelmslev [7] and [8], and reached a particularly simple form in the 
axiomatics presented by F. Bachmann [2] for structures called metric planes. We will 
see that our theorem can be expressed and proved in the theory of non-elliptic metric 
planes in which every segment has a midpoint. Why “non-elliptic’”? Metric planes 
are called elliptic if there are three line reflections in them whose composition is the 
identity. This amounts to the existence of points which have several perpendiculars 
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to a given line. Under those circumstances the uniqueness of midpoints is no longer 
available and the whole edifice of our reduction of the area problem to one about 
reflections crumbles. 


2 Metric planes 


Metric planes, as presented in [2], are defined in the language of groups. The idea 
behind the axiomatization is to focus on line reflections and on their properties when 
composed with other line reflections. One notices that if a and 5 are line reflections, 
and it so happens that ab = ba, then the lines defined by the reflections a and b must 
be perpendicular, and conversely. Since perpendicular lines intersect, and the compo- 
sition of the reflections in them amounts to the reflection in their point of intersection, 
we can choose to refer to the product ab, in case ab = ba, as to a point reflection. 
Another familiar geometric notion, that of point-line incidence can now be expressed 
in the language of line reflections. The point P determined by the point reflection 
ab, where a and b denote line reflections with ab = ba, is said to be incident with 
the line determined by the line reflection g if Pg = gP. 

The axiom system consists of a fundamental assumption, that G be a group (writ- 
ten multiplicatively) generated by an invariant set S of involutory elements, as well 
as five axioms (see below). The elements of S' will be denoted by lowercase Latin 
letters, and will be referred to as line reflections (or simply lines). Involutory products 
of two elements in S' will be denoted by uppercase Latin letters, and will be referred 
to as point reflections (or simply points). For any two elements aw and 6 in G, a | B 
denotes the fact that a - 6 is involutory. The notation a1,...@, | 61,...Bm stands 
for the conjunction of all a; | 6; withi € {1,...,n} and j € {1,...,m}. The rela- 
tion P | g may be read as “point P is incident with line g,” and g | / as “line g is 
orthogonal to line h.” The axioms for metric planes state that 


Al. For all points P and QO there is a line g such that P,Q | g. 


A2. If, for points P and QO and lines g and h, we have P, O | g,h, then P = Q or 
gah. 


A3. If the lines a,b, and c and the point P are such that a,b,c | P, then there is a 
line d such that abc = d. 


Ad. If the lines a, b,c, and g are such that a, b,c | g, then there is a line d such that 
abc = d. 


A5. There are lines g,h, j such that g | h, but none of j | g, j | h, or j | gh holds. 
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By Al and A2 there is a unique line joining two distinct points P and Q. We will 
denote it by (P, Q). It is, in general, not true that a midpoint exists for every pair of 
points. In this context, we say that M is a midpoint for A and B if AM“ = B. Here 
we have denoted by a the element B~!af. Nor is it in general true that a midpoint, 
should it exist, is unique. In fact, to ensure uniqueness (see [2, pp. 53-54]), it is 
necessary and sufficient to assume that the metric plane is not elliptic. This means 
that it satisfies the axiom 


~P. For all a,b,c, we have abc # 1. 


An alternate requirement which is equivalent to ~ P, is that, for any line / and 
any point P not incident with /, there exists a unique line g incident with P and 
perpendicular to /. This requirement is obviously false in the case of plane elliptic 
geometry, in which, for every line there is a point from which one can draw infinitely 
many perpendiculars to that line (that that there are infinitely many perpendiculars 
from a pole of a line to that line follows from [2, Satz 20, §6,12, p. 121], which states 
that there are no finite elliptic planes, so every line must be incident with infinitely 
many points). Double elliptic geometry is already excluded by the axioms for metric 
planes, as there is a unique line joining two distinct points in metric planes. 

Since the theorem we want to prove depends both for the definition of the area and 
for its conclusion on the uniqueness of midpoints, we will be interested only in non- 
elliptic metric planes. Since we also need the unrestricted existence of midpoints, we 
also assume the axiom 


M. For all A, B, there exists M such that AM = B. 


Metric planes can have a Euclidean metric, which means that there exists a rect- 
angle, 1.e. 


R. There are a,b,c, d such thata,b|c,d anda #b,c #d. 
or a non-Euclidean metric, which means that there is no rectangle, i.e. 
~R. Ifa,b|c,d,thena = borc =d. 


For metric planes satisfying R, it can be easily seen that the midlines (U.W) and 
(U’, W’) of two triangles ABC and AB’C having the same area—where U and W 
are the midpoints of (B,C) and (B, A), and U’ and W’ are the midpoints of (B’, C) 
and (B’, A)—are either identical or one is the reflection of the other in (A, C), given 
that, in the presence of R, the area equality of ABC and AB’C is equivalent to the 
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statement that B = B’ or (B, B’) || (A,C), or the midpoint of BB’ lies on the 
line (A, C) (here || stands for the notion of Euclidean parallelism, with g || 4 being 
defined to stand for “for all /, 7 | g if and only if / | h”). 

The interesting case is thus that in which ~R holds. 

Non-elliptic metric planes have been axiomatized in various other languages 
in [11], [17], [12], [13], and [14]. The Hilbert planes are precisely the ordered met- 
ric planes with free mobility. Although there are Hilbert planes in which the sum of 
the angles of a triangle exceeds two right angles, as was discovered by Max Dehn 
in [3], these are non-elliptic in the sense that the perpendicular from a given point 
to a given line is always unique. The phrasing and proof of our theorem inside non- 
elliptic metric planes is thus more general than any version thereof inside Hilbert’s 
plane absolute geometry. 


3 Statement and proof of the theorem in planes 
with non-Euclidean metric 


The theorem we will now prove inside the theory axiomatized by the fundamental 
assumption, Al—A4, M, ~P, and ~R can be stated as follows (we do not need AS, 
for, in the absence of the elements it stipulates to exist, the hypothesis of our theorem, 
that there are three non-collinear points A, B, C, is false, and thus the theorem is 
true): 


Theorem 3.1. Jf A 4 C, A,C |b, B+b,C”Y = B, BY = A, AY =C,U,W 5, 
QO=V",R=V", Bt b,C™ = BY BY = A,U',W'|s', O' =V", 
R=V"0 0" (eR | fe f.and0"? ¢ = 0" hens =! or =". 


Proof. Before starting the proof, we should explain why the statement above reflects 
the intentions of the theorem we want to prove. As mentioned in the introduction, we 
have reduced the original problem to showing that, if R is the reflection of V in U, 
QO is the reflection of V in W, and similarly for R’ and Q’, and (R, Q) = (R’, Q’), 
then the lines s = (U, W) and s’ = (U’, W’) either coincide or one is the reflection 
of the other in the line (A, C). To make sense of the segment congruence relation, 
we observe that we can define it in metric planes by stipulating that (A, B) = (C, D) 
if and only if there exists an element a € G such that C = A® and D = B®. 

We now turn to the reason why the conditions 0’, OU’ | e, R’'| f,e| f, and 
QUU'f — Q! amount to saying that (R,Q) = (R’,Q’). Since RY = V and 
VY’ = R’, we have that RUU’ = R’. By our definition of segment congruence we 
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thus conclude that (R, Q) = (R’, QU’). However, we also have (R’, QUU’) = 
(R’, QO’), for if QYY’ = Q’, there is nothing to prove (and f will have to be 
chosen to be (R’, QO’) and e the perpendicular raised in Q’ on (R’, Q’)), and if 
OU” Z Q’, then f is the unique perpendicular to (QY"’, Q’) passing through R’, 
and (R’, OU") = (R'S, OY"’S), which, given that R’4 = R’ and QUU’S = Q’, 
amounts to (R’, QUY’) = (R’, O’). 

We are now ready for the proof. Let v be such that V | v and v | s, and let v’ 
be such that B | v’ and v’ | s (these perpendiculars dropped from a given point to a 
given line exist by [2, Satz 2 of §3,4, p.37]). According to [2, Satz 2 of §4,1, p. 57], 
we have Uv'W = vandC” = A. Let M be the midpoint of vs and v's, thus a point 
for which v“ = v’. We now have 


@)" = vl, (1) 


(v¥)! = yl. (2) 


Figure 1. The triangle ABC with all relevant points and lines 


To prove (1), first notice that v@M = MvM andv” = v'! = UUW, s0 MvM = 
UvW = WU (given that UvW is a line, thus involutory), and thus (*) MuMU = 
Wv. Using (*), we get (vY)”@ = vW™@ = M(Wv)WM = M(MvMU)WM = 
vMUWM. On the other hand, v’'Y = Uv'U = UUvWU = vWU. To prove (1), 
we thus need to show that vMUWM = vWU, ie., that MUWM = WU, which 
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amounts to (MUW)? = 1. This holds, given that the points M, U, and W are in- 
cident with the line s, and the compositions of reflections in three collinear points 
is a reflection in a point, thus involutory (see [2, Satz 24b of §3,9, p. 49]). To 
prove (2), first notice that vV¥ = (WvU)” = WWvUW = vUW. By (*), we 
have Uv = MvMW,s0 vU4 = M(Uv)UM = M(MvMW)UM = vMWUM. 
To prove (2), we thus need to show that vMWUM = vUW,ie., that MWUM = 
UW, which amounts to (MWU \e = 1, which holds for the reason already men- 
tioned in the proof of (1). 


It is easily checked that if a | B, then aw? | B°. Given that V | v, we have 
VW | vo, andso (VY)™ | (UY)™, ie., by (1), (VY)™ | v’Y. Also by V | v, we 
have VY | vY, andso (VY)™ | (vY)™, ie., by (2), (VU)™ | v'™. 


Note that, again by [2, Satz 2 of §4,1, p. 57], v’ is the perpendicular bisector of 
segment (Q, R), and thus, given that v’” = v and that orthogonality and midpoints 
are preserved under rigid motions, we have that v is the perpendicular bisector of 
(Q1, Ri), where by Q; and R, we have denoted o* and R™. 


Let P = sv, D = v's, let g denote the line (1, R1) joining Q, and R,, and 
T = vg. We will now focus on the following configuration of points and lines: V, v, 
s,b, P,C,u'", D Ri, g, T. The elements of our configuration are in the following 
relations: V | v,b; D | svV-C | b,u'”; P |s,vu; Ril g, ww T |v,g;v|b,s, g; 
v’W | s; (V, P) = (R1, D). The latter holds since VU“ = R, and PUM = D. We 
also have that (R1, 7) is half of (Q1, Ri). Following the same steps, with B replaced 
by B’, U by U’, and W by W’, we construct the points Rj, and T’. 


Since we could not see how to prove this theorem by purely reflection-geometric 
considerations, we will use the fact that all non-elliptic metric planes with a non- 
Euclidean metric can be embedded in a projective-metric plane over some field of 
characteristic ~ 2, containing its origin, such that the image of the point set of the 
non-elliptic metric plane does not intersect the line at infinity (the polar of the origin). 
This major result and the formulas used below are the subject of [2, 86, pp. 93-127, 
§8,1, pp. 141-144]. In the embedded non-elliptic metric planes, the points can be 
written as pairs (a, b), witha, b € F, and the lines can be written in homogeneous co- 
ordinates as triples [u, v, w], not all of u, v, w being allowed to be 0, two such triples 
representing the same line if they can be obtained from each other by multiplying 
each coordinate with the same non-zero element of F. The incidence of (a, b) with 
[u, v, w] (to be denoted by (a, d) | [u, v, w]) is given by au + bv + w = 0, whereas 
the orthogonality of [u, v, w] and [u’, v’, w’] (to be denoted by [u, v, w] | [u’, v’, w’]) 
is given by Auu’ + wvv'+vww’ = 0, where A, ju, v are fixed elements of F, all dif- 
ferent from 0. The congruence of two segments ((a, b), (c, d)) and ((a’, b’), (c’, d’)) 
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is given by 

F((a,b),(c,d))?_ F(a’, b’), (c, d’))? 

O((a,b))O((e.d)) O(a’, b’)) O(c, d’))’ 
where 

F(x, y), @’, y’)) = poxx! + Avyy’ + Ap 
and 


O((x, y)) = F(x, y), , y)). 


We can embed the metric plane in such a way that V is mapped onto (0,0), b is 
mapped onto [0, 1,0] and v onto [1,0,0]. Then, for some a,x € F,C = (a,0), 


P = (0,x), s = [0,1,—x]. Since v'™ is incident with (a,0) and orthogonal to 
[0, 1, —x], if we denote v/Y by [m,n, p], we have am + p = Oand wn —vxp = 0. 
Thus v/Y” = [u,—vxa,—ap]. Since it lies on s, D = (5, x), for some 5 € F. 


Since D is on v/¥ 


ie., D = (a(1 + ux), x) Regarding the coordinates (a, 6) of Ry, we have the 
following two relations that Ry is in 


, we have (6,x) | [“,—vxa,—ay], and so 6 = a(1 + aes 


(i) (a, B) | [, —vxa, —ap] 


and 
v 
ii) ((0,0), (0, x)) = ((a(1 + —x7). x), (@, )). 
Gi) (0.0), 0.) = ((a(1 + 52”). x). @8)) 
From (i) we get thata = a( 7 Bx + 1). Since 


F((0, 0), (0, x))? Le 


Q((0,0))Q((0,x)) vx? + 
and 
F((a(1 + £x?), x), (a(2Bx + 1), B))? 
O(a + 2x2), x) OUaZBx + DB) 
w( 7 Bx + 1)*(va2(1 + ax") + A) 
(1+ ax?) (uva? (+ Bx + 1)? + AvB2 + Ap) 


(ii) gives 


peva?( Bx + 1)" + Avp? +Ap = (Bx + 1) (va2(1 + av) + A). (3) 
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Our hypothesis states, as noted at the start of the proof, that (0, R) = (Q’, R’). 
Since (R,, 7) is half of (QO, R), and (R‘, T’) is half of (Q’, R’), our hypothesis 
implies that (Ri, 7) = (Rj, T’). With R), = (a’, B’), we thus have 


F((O, B),(@,B))?_ _ _F((, B), (a, B’))* 


oO : 4 
Q((0, B))O(a@.B))  O((0, BY) O(a’, B)) - 
Applying (3) to 
F((0,B).(@.B)?__ Avp2 + Au 
Q((0, B))O((a, B)) pva? (+ Bx + iy + AvB2 + Ap 
we get 
F((0,B), (a, 6)? _ Avp? + Ay S 
Q((0, B))O((a, B)) (7 Bx + 1)" (va2(1 + ae) + Ay 
Applying (3) again, we get 
AvB? + Ap = (—px + 1) (vax? +A) 
be 
and thus 
Avp? + Ap = v7a7x? + pA (6) 


(7 Bx + 1)*(va2(1 + ae) + 2) p(va2(1 + a) +A)" 


Since the value of a is fixed, as it is the non-zero coordinate of the fixed point C, 
equations (4)-(6) give 
v7q?x? + pa v7a?x? + pr 


ee q 
p(va2(1 + 7x?) +A) p(va2(1 + zx?) +i) D 


2 — 


which implies x? = x’”, and thus x = x’ or x = —x’. This proves our theorem. 


We conclude that, with the corresponding definitions for area equality, it is true 
in the theory axiomatized by the fundamental assumption and {A1—A4, M, ~P} that 


Theorem 3.2. If ABC and AB'C have the same area and (U, W) and (U', W’) are 
their respective midlines, then they either coincide or one is the reflection of the other 
in {A,C). 


Acknowledgement The author thanks Vincent Alberge for a very close reading of 
the manuscript and for very useful suggestions for improvement. 
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